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GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 

Run on: July 22, 2006, 22:47:34 ; Search time 2065 Seconds 

(without alignments) 
6110.779 Million cell updates/sec 



Title : US- 10 -679 -580A-1 

Perfect score: 6744 

Sequence: 1 ggacc tcgcaggccaagaat aaaccaagaaatgcagcatt 6744 

Scoring table: I DENT IT Y_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 1403666 seqs, 935554401 residues 

Total number of hits satisfying chosen parameters: 280733 2 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Issued_Patents_NA: * 

1 : /EMC_Celerra_SIDS3/ptodata/2/ina/l_COMB. seq: * 

2 : /EMC_Celerra_SIDS3/ptodata/2/ina/5_COMB. seq: * 

3 : /EMC_Celerra_SIDS3/ptodata/2/ina/6A_COMB .seq: * 

4 : /EMC_Celerra_SIDS3/ptodata/2/ina/6B_C0MB .seq: * 

5 : /EMC_Celerra_SIDS3/ptodata/2/ina/7_COMB. seq: * 

6 : /EMC_Celerra_SIDS3/ptodata/2/ina/H_C0MB. seq: * 

7 : /EMC_Celerra_SIDS3/ptodata/2/ina/PCTUS_COMB. seq: * 

8 : /EMC_Celerra_SIDS3 /ptodata/2 /ina/PP_COMB .seq : * 

9 : /EMC_Celerra_SIDS3 /ptodata/2 /ina/RE_COMB .seq: * 

10 : /EMC_Celerra_SIDS3/ptodata/2/ina/backf ilesl . seq : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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% 

Result Query 

No. Score Match Length DB ID Description 
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ALIGNMENTS 



RESULT 1 

US-09-512 -581D-3 

; Sequence 3, Application US/09512581D 
; Patent No. 6994 992 
; GENERAL INFORMATION: 

APPLICANT: Soto, Ana M. 

APPLICANT: Sonnensche in , Carlos 

APPLICANT: Geek, Peter 

APPLICANT: Szelei, Jozsef 
; TITLE OF INVENTION: NOVEL ANDROGEN- INDUCED SUPPRESSOR OF CELL PROLIFERATION AND USES 
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TITLE OF INVENTION: THEREOF 
FILE REFERENCE: 34724-026 

CURRENT APPLICATION NUMBER: US /0 9/5 12 , 5 81D 
CURRENT FILING DATE: 2000-02-24 
PRIOR APPLICATION NUMBER: US 6 0/121,4 61 
PRIOR FILING DATE: 1999-02-24 
NUMBER OF SEQ ID NOS : 72 
SOFTWARE: Patentln version 3.3 
SEQ ID NO 3 
LENGTH: 417 3 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (1) . . (4173) 
US-09-512 -581D-3 

Query Match 26.9%; Score 1815.6; DB 4; Length 4173; 

Best Local Similarity 69.7%; Pred. No. 0; 

Matches 2457; Conservative 0; Mismatches 1069; Indels 0; Gaps 0; 

Qy 173 G GAAG AT CG CT TACC CT CC GGGGG TAAAAG AGAT CACC GACAAG AT CACC AC GGAC GAGA 23 2 

lllll I II II II lllll II II II I II II II II II I I I I 
Db 32 G AAAAAT TA CATATC CG CC TG GGG TC AAGG AAAT AT CAGATAAAAT AT CT AAAG AG GAGA 91 

Qy 2 33 TGATCAAACGCCTGAAGATGGTAGTGAAAACCTTTATGGATATGGATCAGGACTCAGAAG 292 

II I I III I llllllll llllllll llllllllllllll llllllll I I I I 
Db 92 TGGTGAGACGATTAAAGATGGTTGTGAAAACTTTTATGGATATGGACCAGGACT CTGAAG 151 

Qy 2 93 ATGAAAAACAGCAGTAT CTCCCACTAGC CTTGCATCTTGCAT CTGAATTCTTCCTCAGGA 352 

I lllll III III I lllll II llllllll II II II II I I I I I 

Db 152 AAGAAAAGGAGCTTTATTTAAACCTAGCTTTACATCTTGCTTCAGATTTTTTTCTCAAGC 211 

Qy 3 53 ACCCCAATAAAGATGTGCGTCTCCTTGTAGCATGTTGTTTGGCTGATATCTTTCGTAT CT 412 

I II lllllllll II I II lllll II II I llllllll II I II I 
Db 212 ATC CTGGTAAAGATGTT CG CTTACTGGTAGCCTG CTGCCTTGCTGATATTTTCAGGATTT 271 

Qy 4 13 ATGCCCCAGAAGCTCCATATACTTCCCATGATAAACTTAAGGACATATTTTTGTTTATTA 472 

I I I I II llllllll II II I I I I lllllllll lllll MINI IIMIII I 
Db 2 72 ATGCTCCTGAAGCTCCTTACACATCCCCTGATAAACTAAAGGATATATTTATGTTTATAA 331 

Qy 4 73 C CA GA CAAT TAAAAG GTTTGGAGG AT AC AAAG AG TCCACAGTTTAATAGATACTTTTATT 53 2 

I Mill II II II I llllllllllllll Mill II lllll 1 1 IIMIII 

Db 3 32 CAAGACAGTTGAAGGGGCTAGAGGATACAAAGAGCCCACAATTCAATAGGTATTTTTATT 391 

Qy 5 33 TATTAGAGAATTTAGCTTGGGTTAAATCATATAACATCTGCTTTGAATTGGAAGATTGCA 59 2 

II I Mill I llllllll II IIIIIIIIIII llllllll II IMIII III 

Db 3 92 TACTTGAGAACATTGCTTGGGTCAAGTCATATAACATATG CTTTGAGTTAGAAGATAGCA 451 

Qy 5 93 ATG AAATTT TT AT TC AG CT TT TTA GAACTC TCTT CT CAGT GATC AACAAT AG CC AC AATA 652 

IIIIIIIIII I lllll I lllll I II lllll II IMIII llllllll 
Db 4 52 ATGAAAT TT TC AC CC AG CT AT ACA GA AC CT TATTTT CAGT TATA AACAAT GG CC AC AATC 511 

Qy 6 53 AGAAGGTACAAATGCACATGCTAGATTTGATGAGTTCTATCATCATGGAAGGTGATGGAG 712 

MM II II IMIIIIII MM I lllll lllll II IIIIMIII II 

Db 512 AGAAAGTCCATATGCACATGGTAGACCTTATGAGCTCTATTATTTGTGAAGGTGATACAG 571 

Qy 713 TTACTCAAGAATTATTGGG CTCCATT CTTATTAACCTCATTCCTGCACATAAGAACTTAA 772 

I 1 1 1 1 II I 1 1 1 1 I II I Ml II I lllll llllllll 1 1 1 1 

Db 5 72 TGTCTCAGGAGCTTTTGGATACGGTTTTAGTAAATCTGGTACCTGCTCATAAGAATTTAA 631 
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Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 

Run on: July 24, 2006, 01:59:18 ; Search time 36612 Seconds 

(without alignments) 
11779.237 Million cell updates/sec 

Title: US-10-679 -580A-1 

Perfect score: 6744 

Sequence: 1 ggacctcgcaggccaagaat aaaccaagaaatgcagca tt 6744 

Scoring table: - IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 6366136 seqs, 31973710525 residues 

Total number of hits satisfying chosen parameters: 12732272 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : GenEmbl:* 



1 




gb_env : 




2 




gb_pa t : 


* 


3 




gb_ph : * 




4 




gb_pl : * 




5 




gb_pr : * 




6 




gb_ro : * 




7 




gb_sts : 


★ 


8 




gb_sy : * 




9 




gb_un : * 




10 


gb_vi : 




11 


gb_o v : 


★ 


12 


gb_htg 


: * 


13 


gb_in: 


* 


14 


gb_om : 


* 


15 


gb_ba : 


* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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CQ496390 Sequence 


24 


182 0 . 4 


27 


.0 


5297 


11 


BC08 6289 


BC08 62 89 Xenopus 1 


25 


181 7 . 2 


26 


. 9 


5253 


5 


HSU95825 


U95 82 5 Human andro 


2 6 


1808.8 


26 


.8 


5276 


1 1 


AJ85 1551 


AJ851551 Gallus ga 


2 7 


1789 


26 


. 5 


5317 


6 


AY 1 0 2 2 6 7 


AY102 26 7 Mus mus cu 


2 8 


1785.4 


26 


.5 


5472 


6 


BC096 53 9 


BC0 96 53 9 Mus muscu 


2 9 


1773.2 


26 


.3 


5 154 


1 1 


BC10 0220 


BC100220 Xenopus 1 


3 0 


1706 . 2 


25 


.3 


2265 


2 


CS169 195 


CS169195 Sequence 


3 1 


1706 . 2 


25 


.3 


2265 


2 


AX835 316 


AX835316 Sequence 


3 2 


1706.2 


25 


.3 


2265 


5 


AK0 98 33 1 


AK098331 Homo sapi 


33 


1706 . 2 


25 


.3 


2 705 


5 


BC04136 1 


BC041361 Homo sapi 


3 4 


1703 


25 


.3 


193 6 


5 


AY550 96 8 


AY550968 Homo sapi 


3 5 


168 6. 6 


25 


.0 


1722 


2 


AR379 57 6 


AR379576 Sequence 


36 


1594 . 4 


23 


.6 


1976 


6 


BC021408 


BC021408 Mus muscu 


37 


1489. 6 


22 


.1 


188061 


12 


AC173425 


AC173425 Bos tauru 


38 


1489 


22 


.1 


1489 


2 


BD160058 


BD160058 Primer fo 


39 


1489 


22 


.1 


1489 


2 


AX882999 


AX882999 Sequence 


40 


1489 


22 


.1 


1489 


5 


AK023592 


AK023592 Homo sapi 


41 


1357 . 8 


20 


.1 


5944 


6 


AK122414 


AK122414 Mus muscu 


42 


979. 2 


14 


.5 


3632 


11 


AY695732 


AY695732 Xenopus 1 


43 


918 


13 


.6 


1852 


5 


HSU50 53 3 


U50 53 3 Human BRCA2 


44 


806 


12 


.0 


4252 


5 


BC039256 


BC039256 Homo sapi 


45 


761. 2 


11 


.3 


110000 


12 


AC156034 2 


Continuation (3 of 



ALIGNMENTS 



RESULT 1 
AF294791 

LOCUS AF294791 6744 bp mRNA linear PRI 22-MAR-2004 

DEFINITION Homo sapiens SCO 112 (SCO 112) mRNA, complete cds . 
ACCESSION AF2 94 791 
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VERSION AF294791.1 GI:21951801 

KEYWORDS 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebra ta ; Euteleostomi ; 
Mammalia; Eutheria; Euarchontogl ires ; Primates; Catarrhini ; 
Hominidae; Homo. 
REFERENCE 1 (bases 1 to 6744) 

AUTHORS Kumar, D., Sakabe,I., Patel,S., Zhang , Y., Ahmad, I., Gehan,E.A., 

Whiteside , T . L . and Kasid,U. 
TITLE SCC-112 , a novel cell cycle-regulated molecule, exhibits reduced 

expression in human renal carcinomas 
JOURNAL Gene 328, 187-196 (2004) 
PUBMED 15019 99 8 
REFERENCE 2 (bases 1 to 6744) 
AUTHORS Kumar ,D. and Kasid,U. 
TITLE Direct Submission 

JOURNAL Submitted ( 09 -AUG -200 0) Radiation, Medicine, 3970 Reservoir Rd, NW, 
Washington, DC 20007, USA 
FEATURES Location/Qualifiers 
source 1. .6744 

/organise "Homo sapiens 11 

/mol_type="mRNA" 

/db_xr ef = " taxon : 9 60 6 " 

/chromosome^ "4 " 
gene 1. .6744 

/gene= "SCC-112 " 
CDS 232. .4125 

/gene= "SCC-112 " 

/codon__start =1 

/product= M SCC-112 " 

/protein_id= "AAM8 23 47 . 1 " 

/db_xr ef = " GI : 2 195 18 02 '» 

/trans lation=" MIKRLKMWKTFMDMDQDSEDEKQQYLPLALHLASEFFLRNPNK 
DVRLLVACCLADIFRIYAPEAPYTSHDKLKDIFLFITRQLKGLEDTKSPQFNRYFYLL 
. ENLAWVKS YNICFELED CNEIFIQLFRTLFSVINNSHNKKVQMHMLDLMSS IIMEGDG 
VTQELLGS ILINLI PAHKNLNKQSFDLAKVLLKRTVQTI EACI ANFFNQVLVLGRS SV 
SDLSEHVFDLIQELFAIDPHLLLSVMPQLEFKLKSNDGEERLAWRLLAKLFGSKDSD 
LATQNRPLWQCFLGRFNDIHVPVRLESVKFASHCLMNHPDLAKDLTEYLKVRSHDPEE 
AIRHDVIVTIITAAKRDLALVNDQLLGFVRERTLDKRWRVRKEAMMGLAQLYKKYCLH 
GEAGKEAAEKVSWIKDKLLHI YYQNSIDD KLLVEKIFAQYLVPHNLETEERMKC LYYL 
YASLDPNAVKALNEMWKCQNMLRSHVRELLDLH KQ PTSEANCSAMFGKLMTIAKNL PD 
PGKAQDFVKKFNQVLGDDEKLRSQLELLI SPTCSCKQADICVREIARKLANPKQ PTNP 
FLEMVKFLLERI APVHIDSEAISALVKLMNKSIEGTADDEEEGVSPDTAIRSGLELLK 
VLSFTHPTSFHSAETYESLLQCLRMEDDKVAEAAIQIFRNTGHKIETDLPQIRSTLIP 
ILHQKAKRGTPHQAKQAVHCIHAIFTNKEVQLAQIFEPLSRSLNADVPEQLITPLVSL 
GH ISMLAPDQFASPMKS WANFI VKDLLMNDRSTGEKNGKLWS PDEE VS PEVLAKVQA 
IKLLVRWLLGMKNNQSKSANSTLRLLSAMLVSEGDLTEQKRISKSDMSRLRLAAGSAI 
MKLAQEPCYHEI ITPEQFQLCALVINDECYQVRQI FAQKLHKALVKLLLPLEYMAI FA 
LCAKDPVKERRAHARQCLLKNISIRREYIKQNPMATEKLLSLLPEYVVPYMIHLLAHD 
PDFTRSQDVDQLRD IKE CLWFMLEVLMTKNENNSHAFMKKMAENIKLTRDAQSP DESK 
TNEKLYTVCDVALCVINSKSALCNADSPKDPVLPMKFFTQPEKDFCNDKSYISEETRV 
LLLTGKPKPAGVLGAVNKPLSATGRKPYVRSTGTETGSNINVNSELNPSTGNRSREQS 
SEAAETGVSENEENPVRIISVTPVKNIDPVKNKEINSDQATQGNISSDRGKKRTVTAA 
GAENIQQKTDEKVDESGPPAPSKPRRGRRPKSESQGNATKNDDLNKPINKGRKRAAVG 
QESPGGLEAGNAKAPKLQDLAKKAAPAERQIDLQR " 

ORIGIN 



Query Match 100.0%; Score 6744; DB 5; Length 6744; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 6744; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
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Qy 1 GGACCTCGCAGGCCAAGAATTCGGCACGAGGGGCGCCGGCTCCCGGGGCACGGACGGCCG 60 

I II II II II II II II II II II III II II II II II II II II II II II II II MINI II II 

Db 1 GGACCTCGCAGGCCAAGAATTCGGCACGAGGGGCGCCGGCTCCCGGGGCACGGACGGCCG 60 

Qy 61 GGCGCGCGCCTCTGCGAGGGGCGTCCGGGTCCGAGTCGGCGGTCCGGGCCGGCGCGAGGT 12 0 

I II II II II IMIIIII II II Mill II II II INI MM II II II II I! II II II II II 

Db 61 GGCGCGCGCCTCTGCGAGGGGCGTCCGGGTCCGAGTCGGCGGTCCGGGCCGGCGCGAGGT 12 0 

Qy 121 GCGTGCGGGCGGGCCGCGGGGGTCCCGGACGGACACAAGCGCACACACTCCCGGAAGATC 18 0 

I 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I M 1 1 1 1 1 1 ! 1 1 1 II 1 1 

Db 121 GCGTGCGGGCGGGCCGCGGGGGTCCCGGACGGACACAAGCGCACACACTCCCGGAAGATC 180 

Qy 181 G CT TA CC CT CCGG GG GT AA AAGAG AT CA CC GA CAAG AT CA CC AC GG AC GAGATG AT CAAA 24 0 

1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 ! 1 1 1 1 i 1 1 1 1 1 1 M I M 1 1 1 M 1 1 1 1 1 M I ! 

Db 181 G CTTACC CT CCGGGGGT AAAAGAG ATCACCGACAAG AT CACC ACGGACGAGATG AT CAAA 24 0 

Qy 2 41 CGCCTGAAGATGGTAGTGAAAACCTTTATGGATATGGATCAGGACTCAGAAGATGAAAAA 30 0 

IMMM II II MMIIMM III MMM II MINI MMMMMMIIMM MM 

Db 2 41 C GC CT GAAG AT GGTAGTGAAAACC TTTATG GATATG GATC AG GA CT CAGAAG AT GAAAAA 30 0 

Qy 3 01 C AG CAGT AT CT CC CACT AG CC TTG CATC TT GC AT CTGAAT TCTT CCTC AG GAAC CC CAAT 36 0 

I II II II II II MM II II II III M II II II II II II II II II II II II II II II II II 

Db 3 01 C AG CAGT AT CT CC CACT AG CC TTG CATCTT GC AT CT GA AT TC TT CC TC AG GAAC CC CAAT 36 0 

Qy 3 61 AAAGATGTGCGTCTCCTTGTAGCATGTTGTTTGGCTGATATCTTTCGTATCTATGCCCCA 42 0 

I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3 61 AAAGATGTGCGTCTCCTTGTAGCATGTTGTTTGGCTGATATCTTTCGTATCTATGCCCCA 42 0 

Qy 4 21 GAAGCTCCATATACTTCCCATGATAAACTTAAGGACATATTTTTGTTTATTACCAGACAA 48 0 

I 1 1 1 If I MM II II MMM Ml MMM MINI MINI MINIM MINI MM 

Db 4 21 GAAGCTCCATATACTTCCCATGATAAACTTAAGGACATATTTTTGTTTATTACCAGACAA 48 0 

Qy 4 81 TTAAAAGGTTTGGAGGATACAAAGAGTCCACAGTTTAATAGATACTTTTATTTATTAGAG 54 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 i 1 1 1 1 1 1 1 1 1 i 1 1 1 1 i 1 1 1 1 ! 1 1 1 1 1 E 1 1 1 1 1 1 i 1 1 i 

Db 4 81 TTAAAAGGTTTGGAGGATACAAAGAGTCCACAGTTTAATAGATACTTTTATTTATTAGAG 54 0 

Qy 541 A AT TT AG CT TG GG TT AA AT CA TAT AA C A TC TG CT TT GA AT TG GAAG AT TG CAAT GA AATT 60 0 

I MM II II II IIIIMM II III II MM II II II MINI IIIIMM MMM M II 

Db 541 AAT TT AG CT TG GG TT AA AT CA TAT AA CATC TG CT TT GAAT TG GAAG AT TG CAAT GAAATT 60 0 

Qy 6 01 T TT AT TC AG CT TT TT AG AA CT CTC TT CT CAGT GATC AACAAT AG CC AC AATAAG AAGG TA 66 0 

I II II II II II II M II II II III II II II II II II II II II II II II M II II II II II 

Db 6 01 TTTATTCAGCTTTTTAGAACT CTC TTCTCAGTGATCAACAATAGCCACAATAAGAAGGTA 660 

Qy 6 61 CAAATGCACATGCTAGATTTGATGAGTT CTATCATCATGGAAGGTGATGGAGTTACTCAA 72 0 

I MM II II II II II M II II III II MM M M II II II II II II II II M M II II II 

Db 6 61 CAAATGCACATGCTAGATTTGATGAGTT CTATCATCATGGAAGGTGATGGAGTTACTCAA 72 0 

Qy 7 21 G AATT AT TGGG CT CC AT TC TT ATT AA CC TC ATTC CT GC AC AT AAGAACTT AAAT AA AC AG 78 0 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 G AATT ATTGGGCTCCATTCTT ATT AACCTCATTCCTGCACAT AAGAACTT AAAT AAACAG 780 

Qy 7 81 TCCTTTGACCTTGCAAAAGTGCTATTGAAAAGAACAGTCCAGACTATTGAGGCATGCATT 84 0 

1 1 1 1 1 1 1 1 j 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ' 

Db 7 81 TCCTTTGACCTTGCAAAAGTGCTATTGAAAAGAACAGTCCAGACTATTGAGGCATGCATT 84 0 

Qy 841 GCCAATTTTTTCAATCAAGTCCTG GTGCTGGGAAGATCATCAGTAAGTGATTTGTCAGAA 900 

IIIIIIIIIIIMIIIIIIII Ml II MIIIIMIIIIMII II MMMIMIMM II 

Db 841 GCCAATTTTTTCAATCAAGTCCTG GTGCTGGGAAGATCATCAGTAAGTGATTTGTCAGAA 90 0 
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Qy 9 01 CATGTATTTGATCTGATTCAGGAACTTTTTGCTATAGATCCTCATTTATTATTATCCGTC 96 0 

III II II INI II II II II II III II MM MM II II II II II II II MM INI II II 

Db 9 01 CATGTATTTGATCTGATTCAGGAACTTTTTGCTATAGATCCTCATTTATTATTATCCGTC 96 0 

Qy 9 61 ATG CC AC AG CTTG AATT CAAACTAAAGAGC AATG AT GG AG AAGAGCGATT AG CT GT TGTT 10 2 0 

I II II II M II II II II II II III II MM M II II II II M II II II MM II M II M 

Db 9 61 ATG CC AC AG CTTG AATT CAAACTAAAGAGC AATG AT GG AG AAGAGC GATT AG CT GT TGTT 10 20 

Qy 1021 CGACTTCTAGCTAAATTGTTTGGCTCCAAAGATT CTGATTTGGCAACACAGAATCGTCCT 1080 

I MM II II II II II M MM III II MM MM II MUM II II II MM M M MM 

Db 10 21 CGACTTCTAGCTAAATTGTTTGGCTCCAAA GATT CTGATTTGGCAACACAGAATCGTCCT 10 80 

Qy - 10 81 CTTTGGCAATGTTTTCTTGGACGATTTAATGATATTCATGTTCCTGTGAGATTAGAAAGT 114 0 

I 1 1 M 1 1 i 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 It 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 

Db 10 81 C TT TG GC AA TG TT TT CT TG G A CGA TT T A AT GA T A TT CA TG TT C C TG TG AG AT T AGA AA GT 114 0 

Qy 1141 GTGAAATTTGCCAGT CATTGTTTAATGAATCACCCAGATTTAGCGAAGGATCTCACAGAA 12 00 

I II II II II II II II II MM Ml II II II II II II II M II II II II II M II II II II 

Db 1141 G TG AAAT TT GC CA GT CATT GT TTA AT GAAT CA CC CA GATTTAGC GAAG GATC TC AC AG AA .1200 

Qy 12 01 TATTTAAAGGTTAGATCACATGATCCAGAAGAAGCTATTCGTCATGATGTCATTGTTACT 12 60 

I II II II II II II II II MM Ml II II M II II II II II II II M II II II II II II II 

Db 12 01 T ATTTAAAGGTTA GATC ACATGATCCAGAA GAAG CTATTCGTCATGATGT CATT GT TACT 12 60 

Qy 12 61 ATAATAACAGCTGCCAAGAGGGACCTGGCCTTAGTAAATGATCAGCTGCTTGGCTTTGTA 13 20 

I II II M II II II II II MMIM II II II II II II II II M II II II II M M II II II 

Db 12 61 ATAATAACAGCTGCCAAGAGGGAC CTGGCCTTAGTAAATGATCAGCTGCTTGGCTTTGTA 13 20 

Qy 13 21 AGGGAAAGAACACTGGATAAACGGTGGCGAGTAAGAAAAGAAGCTATGATGGGTCTGGCT 13 80 

I II II II II II II II II II Mill II M II II M II II II M II II II II II II II II II 

Db 13 21 AGG GAAAGA AC AC TG GATAAACGG TGGC GAGT AAGAAAAG AAGC TATG AT GG GT CT GG CT 13 80 

Qy 13 81 CAGCTTTATAAGAAATACTGTCTTCATGGTGAAGCAGGAAAGGAAGCTGCAGAGAAAGTC 14 4 0 

1 1 1 1 1 II M 1 1 1 1 1 1 I M I II 1 1 1 II M 1 1 1 1 1 1 M 1 1 1 1 II 1 1 II M II II 1 1 II 1 1 II 

Db 13 81 CAGCTTTATAAGAAATACTGTCTTCATGGTGAAGCAGGAAAGGAAGCTGCAGAGAAAGTC 14 4 0 

Qy 14 41 AGCTGGATAAAGGACAAACTTCTGCATATTTATTATCAGAACAGCATTGACGACAAACTG 15 00 

I M II II II II II II II II II Ml II II II II II II II II II II II II M M II II II II 

Db 1441 AGCTGGATAAAGGACAAACTT CTG CATATTTATTATCAGAACAGCATTGACGACAAACTG 1500 

Qy 15 01 T TG GT AG AG AAAATC TTTG CT CAG TATC TTGT CC CC CA CAAC CT GG AAAC AG AAGA GAGA 15 60 

I II II Mil II II II II Mil III II II II II II II M II II II II II II II II II II II 

Db 15 01 TTGGTAGAGAAAATCTTTGCT CAG TATCTTGTCCCCCACAACCTGG AAAC AG AAGA GAGA 15 60 

Qy 15 61 ATG AAAT GCTT AT AT TACT TA TAT GC TAGT TT GG AT CC AA AT GC TG TA AAAG CT CT CAAC 1620 

I II II II II II II II II MM III II II II II MM II II MM II II II M M M II II 

Db 15 61 ATG AAAT GCTT AT AT TACT TATAT GC TAGT TT GG AT CC AAAT GC TGTAAAAG CT CT CAAC 1620 

Qy 1621 GAAATGTGGAAGTGTCAGAACATGCTTCGGAGCCATGTACGCGAACTATTGGATTTGCAC 1680 

I MM MINI II II MMM III MM MINIM MINIM II MM MM MM II 

Db 1621 GAAATGTGGAAGTGT CAGAACATGCTTCGGAGCCATGTACGCGAACTATTGGATTTGCAC 1680 

Qy 1681 AAGCAGCCTACATCAGAGGCTAACTGTTCTGCCATGTTTGGAAAACTGATGACCATAGCA 1740 

I II II II II II II M II II II III II MM II M II II II II II II II II II II II II II 

Db 16 81 AAG CAGC CT AC AT CAGAGG CT AAC TGTT CTGC CATGTT TG GAAAAC TG ATGACC AT AG CA 1740 

Qy 1741 AAGAATTTGCCTGACCCCGGGAAAGCACAAGATTTTGTGAAGAAATTTAACCAGGTTCTC 18 00 

I II II II II II II M II II M III II II II II II II II M II II II M II II II II M II 

Db 17 41 AAGAATTTGCCTGACCCCGGGAAAGCACAAGATTTTGTGAAGAAATTTAACCAGGTTCTC 18 00 

Qy 18 01 GGCGATGATGAGAAACTTCGGTCTCAGTTGGAGTTATTAATTAGCCCAACCTGTTCTTGC 18 60 
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Score Home Page Retrieve Application List SCORE System Overview SCORE FAQ Comments / Su gg estions 

This page gives you Search Results detail for the Application 10679580 and Search Result us-10-679-580a- 

l.rng. 

start 

Go Back to previous page 

GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title : 

Perfect score: 
Sequence : 

Scoring table: 



July 24, 2006, 00:55:58 ,- Search time 3752 Seconds 

(without alignments ) 
12532.208 Million cell updates/sec 

US-10-679 -580A-1 
6744 

1 ggacctcgcaggccaagaat aaaccaagaaatgcagcatt 6744 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 5244920 seqs, 3486124231 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



10489840 



Database : 



N Genese 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 



genes 
genes 
genes 
genes 
genes 
genes 
genes 
genes 
genes 
gene 
gene 
gene 
gene 
gene 
gene 



q_8 : * 
eqnl98 
eqnl99 
eqn2 00 
eqn2 0 0 
eqn2 00 
eqn2 0 0 
eqn2 00 
eqn2 00 
eqn2 0 0 
seqn20 
seqn20 
seqn20 
seqn20 
seqn20 
seqn20 



0s 

0s ? * 
0s ? * 
las : 
lbs: 
2as : 
2bS: 
3as : 
3bS: 
03CS ; * 
03ds : 
04as : 
04bs : 
05S: * 
06S: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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SUMMARIES 

% 

Result Query 



NO . 


Score 


Match 


Length 


DB 


ID 


Description 




1 


6744 


100 


. 0 


674 4 


8 


ABX14 056 


Abxl4 05 6 cDNA enco 


- 2 


6744 


100 


.0 


6744 


14 


ADX0 62 62 


Adx0 62 62 Cyclin-de 


^ ftp 4 


655 5. 6 
538 5. 8 


97 
79 


.2 
. 9 


6 95 6 
546 9 


8 
6 


ACC72 03 1 
ABQ99271 


ACC72031 BCU0720 g 
Abq99 271 Human cod 




5 


5 17 7 


76 


. 8 


517 7 


14 


AED1 83 29 


Aedl8329 Fibrotic 




6 


5 17 7 


76 


. 8 


517 7 


15 


AEF7 4 9 5 6 


Aef74956 Human pol 




7 


3 875.4 


57 


.5 


3 957 


3 


AAA4 7 42 3 


Aaa47423 Sequence 




8 


219 8 


32 


.6 


2212 


4 


AAH17 13 2 


Aahl713 2 Human cDN 




9 


212 8.4 


31 


. 6 


2 29 5 


4 


AAS02 396 


Aas02 3 96 Human sec 




10 


1983.2 


29 


.4 


2002 


' 4 


AAH14 767 


Aahl4 76 7 Human cDN 




11 


1953 . 4 


29 


. 0 


2496 


6 


ABK35 35 8 


Abk35 35 8 Human cDN 




12 


1823.6 


27 


. 0 


7473 


5 


ABV25 46 9 


Abv254 6 9 Human pro 




13 


1822 


27 


. 0 


5309 


15 


AEF7 4 5 63 


Aef74563 Human pol 




14 


1822 


27 


. 0 


7444 


13 


ADR2 56 94 


Adr25694 Breast ca 




15 


1822 


27 


.0 


7473 


5 


ABV2243 0 


Abv2243 0 Human pro 




16 


1822 


27 


.0 


7473 


5 


ABV28 24 4 


Abv2 8 24 4 Human pro 




17 


1815 . 6 


26 


.9 


5271 


3 


AAA2 8 051 


Aaa28 051 Human and 




18 


1815 . 6 


26 


.9 


5355 


3 


AAA2 8 05 2 


Aaa28 052 Human and 




19 


1706.2 


25 


.3 


2265 


11 


ADM0 37 55 


Adm0 37 55 Human cDN 




20 


1706.2 


25 


.3 


2265 


14 


AEC8 66 85 


Aec86685 Human cDN 




21 


168 6. 6 


25 


. 0 


1722 


11 


ADI3 07 95 


Adi3 07 95 Human cDN 




22 


168 6. 6 


25 


. 0 


1722 


13 


ADS828 62 


Ads82862 Human lym 




23 


148 9 


22 , 


. 1 


148 9 


4 


AAH18066 


Aahl8 06 6 Human cDN 




24 


114 8 . 8 


17 . 


. 0 


1217 


3 


AAF1833 8 


Aafl8338 Lung cane 




25 


726.8 


10 , 


. 8 


785 


4 


AAH07766 


Aah07766 Human cDN 




26 


68 6.4 


10 . 


.2 


1104 


6 


ABS56684 


Abs56 68 4 Euchromos 




27 


670 . 4 


9 . 


. 9 


4 12 8 


13 


ADR0 82 32 


Adr0 82 3 2 Full leng 




28 


666.8 


9 . 


. 9 


772 


4 


AAH03 894 


Aah03 8 94 Human cDN 




29 


593.2 


8 , 


.8 


663 


4 


AAH07307 


Aah073 0 7 Human cDN 


c 


30 


581.4 


8 , 


.6 


584 


6 


ABK44 862 


Abk44 86 2 cDNA enco 


c 


31 


478.6 


7 . 


. 1 


542 


4 


AAH09415 


Aah09415 Human cDN 


c 


32 


462 . 2 


6 , 


. 9 


512 


4 


AAH12296 


Aahl22 96 Human cDN 




33 


437.6 


6 . 


. 5 


452 


3 


AAC30259 


Aac3 0 25 9 Human sec 


c 


34 


430.8 


6 , 


.4 


495 


4 


AAH12 73 9 


Aahl2 73 9 Human cDN 


c 


35 


429.8 


6 . 


.4 


471 


9 


ACH16302 


Achl6302 Human adu 




36 


417.2 


6 . 


.2 


434 


8 


ABZ17884 


Abzl7884 S2 subtra 




37 


408 


6 . 


. 0 


408 


6 


ABV95661 


Abv95 661 Human pan 


c 


38 


389.2 


5 . 


. 8 


391 


6 


ABL38139 


Abl3 813 9 Human col 




39 


381 


5 , 


. 6 


413 


9 


ACH4 9 02 4 


Ach4 9 02 4 Human leu 




40 


376. 8 


5 . 


.6 


422 


9 


ACH48 906 


Ach48906 Human leu 


c 


41 


360 . 2 


5 . 


.3 


376 


6 


ABL37597 


Abl37597 Human col 




42 


359 


5 . 


.3 


2416 


12 


AD03 55 88 


Ado3 55 88 Novel mou 


c 


43 


346. 2 


5 . 


, 1 


706 


10 


ADD3 5009 


Add35009 Mouse mit 




44 


335.2 


5 . 


, 0 


380 


14 


AEE11851 


Aeell851 Hamster c 




45 


335. 2 


5 . 


.0 


380 


14 


AEE154 93 


Aeel5493 Hamster S 



ALIGNMENTS 



RESULT 1 
ABX14056 

ID ABX14056 Standard; cDNA; 6744 BP. 
XX 

AC ABX14056; 
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xx 

DT 24 -FEB -200 3 (first entry) 
XX 

DE cDNA encoding human SCO 112 tumour supressor gene. 
XX 

KW ss ; gene; human; chromosome 4pl4; SCC-112 ; cancer; apoptosis; allodynia,- 

KW degenerative disorder; metastasis inhibition; breast cancer; causalgia; 

KW kidney cancer; bladder cancer; pancreatic cancer; colon cancer; 

KW. squamous cell carcinoma; head trauma; spinal cord injury; herpes zoster; 

KW global and focal ischaemic and haemorrhagic stroke; epilepsy; neuralgia; 

KW hypoxia- induced nerve cell damage; anxiety; diabetes mellitus; 

KW cardiac arrest; spinal cord lesion; stomach cancer; lung cancer; 

KW neonatal distress; Alzheimer's disease; uterine cancer; colon cancer; 

KW multiple sclerosis; phantom limb pain; hyperalgesia; Down 1 s syndrome; 

KW Huntington's disease; Parkinson's disease; Korsakoff's syndrome; 

KW amyotrophic lateral sclerosis;- cell survival; cell proliferation; 

KW tumour supressor. 
XX 

OS Homo sapiens. 
XX 

FH Key 

FT CDS 
FT 
FT 

FT polyA_signal 
FT 
FT 
XX 

PN WO200281641-A2 
XX 

PD 17-OCT-200 2. 
XX 

PF 08-APR-2002; 2 00 2WO -US0 10 85 0 . 

XX 

PR 06-APR-200 1; 2 00 1US - 02 8 17 8 0 P . 
XX 

PA (GEOU ) UNIV GEORGETOWN. 
XX 

PI Kasid UN, Kumar D, Ahmad I; 
XX 

DR WPI; 2003-103330/09. 

DR P-PSDB; ABG72803. 
XX 

PT New isolated SSC (undefined) tumor suppressor polypeptides and 

PT polynucleotides, useful for diagnosing, preventing or treating cancer or 

PT degenerative disease, e.g. Alzheimer's Disease, Huntington's disease, or 

PT multiple sclerosis. 

XX 

PS Claim 1; Fig IB; 83pp; English. 
XX 

CC The invention relates to a new isolated polypeptide SCC-112 . The SCC-112 

CC polypeptides and polynucleotides are useful for diagnosing, preventing or 

CC treating cancer (e.g. breast cancer, kidney cancer, bladder cancer, 

CC pancreatic cancer, colon cancer, squamous cell carcinoma, uterine cancer, 

CC stomach cancer, colon cancer, lung cancer); or degenerative disease or 

CC disorder (e.g. global and focal ischaemic and haemorrhagic stroke, head 

CC trauma, spinal cord injury, hypoxia -induced nerve cell damage, nerve cell 

CC damage caused by cardiac arrest or neonatal distress, epilepsy, anxiety, 

CC diabetes mellitus, multiple sclerosis, phantom limb pain, causalgia, 

CC neuralgias, herpes zoster, spinal cord lesions, hyperalgesia, allodynia, 

CC Alzheimer's disease, Huntington's disease, Parkinson's disease, 



Loca ti on/Qua li fie rs 
232. .4125 
/*tag= a 

/product = "Human SCC-112" 
6721. .6726 
/*tag= b 

/standard_name= "PolyA signal" 
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CC amyotrophic lateral sclerosis, Down's syndrome and Korsakoff's syndrome). 

CC The polypeptides and polynucleotides are also useful for inducing 

CC apoptosis in cancer cells, increasing survival or proliferation of a 

CC cell, or inhibiting cancer cell proliferation and/or metastasis in a 

CC cancer patient. The polynucleotides can be used as probes to detect 

CC complementary nucleotide sequences, or as primers to obtain additional 

CC copies of the polynucleotides. SSC-112 may also be used for identifying 

CC drugs for treatment of cancers. The present sequence represents cDNA 

CC encoding the human SCC-112 tumour supressor gene which is located on 

CC chromosome 4 pi 4 
XX 

SQ Sequence 6744 BP; 2156 A; 1243 C; 1400 G; 1945 T; 0 U; 0 Other; 

Query Match 100.0%; Score 6744; DB 8; Length 6744 ; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 6744; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 GGACCTCGCAGGCCAAGAATTCGGCACGAGGGGCGC CGGCTCCCGGGGCACGGACGGCCG 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 GGACCTCGCAGGCCAAGAATTCGGCACGAGGGGCGC CGGCTCCCGGGGCACGGACGGCCG 60 

Qy 61 G GC GCGC GC CT CT GC GAGG GG CGT CC GG GT CC GAGT CGGC GGTC CGGG CC GG CG CG AGGT 12 0 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 GGCGCGCGCCTCTGCGAGGGGCGTCCGGGTCCGAGTCGGCGGTCCGGGCCGGCGCGAGGT 12 0 

Qy 121 GCGTGCGGGCGGGCCGCGGGGGTCCCGGACGGACACAAGCGCACACACTCCCGGAAGATC 18 0 

I II II II M II II II II II II III II II INI II II II Mil II II II II II II MM II 

Db 121 GCGTGCGGGCGGGCCGCGGGGGTCCCGGACGGACACAAGCGCACACACTCCCGGAAGATC 18 0 

Qy 181 GCTTACCCTCCGGGGGTAAAAGAGATCACCGACAAGATCACCACGGACGAGATGATCAAA 24 0 

I II II II II II II M II II II III II II M II M II II II II M II II II II II II II II 

Db 181 G CT TACC CT CC GG GGGT AA AAGAG AT CA CC GACAAG AT CA CC AC GG AC GA GATG AT CA AA 24 0 

Qy 241 C GC CT GA AG AT GG TA GT GAAAACC TT TATG GATATG GATC AG GA CT CA GAAG AT GAAAAA 300 

I II llllllll II !l II M II Mill II IIMIIMIIII MM INI II II MM MM 

Db 241 C GC CT GAAG AT GG TA GT GAAAACC TT TATG GATA TG GAT C AG GA CT CA GAAG AT GAAAAA 3 00 

Qy 3 01 CAGCAGTATCTCCCACTAGCCTTG CATCTTGCAT CTGAATTCTTCCTCAGGAACCCCAAT 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3 01 C AG CAGT AT CT CC CA CT AG CC TTG CATC TT GC AT CT GA AT TC TT CC TC AGGAAC CC CAAT 36 0 

Qy 3 61 AAAGATGTGCGTCTCCTTGTAGCATGTTGTTTGG CTGATATCTTTCGTAT CTATGCCCCA 42 0 

I II IMIMM II II MM M III MM MINI II MMM II II MM M II MM II 

Db 3 61 AAAGATGTGCGTCTCCTTGTAGCATGTTGTTTGG CTGATATCTTTCGTAT CTATGCCCCA 42 0 

Qy 4 21 GAAGCTCCATATACTTCCCATGATAAACTTAAGGACATATTTTTGTTTATTACCAGACAA 48 0 

Ml II II II II II II II II II III II M II II II M II II II II II II M II II II II M 

Db 4 21 GAAGCTCCATATACTTCCCATGATAAACTTAAGGACATATTTTTGTTTATTACCAGACAA 48 0 

Qy 4 81 T TAAAAG GT TT GG AG GA TA CAAAG AG TC CA CAGT TT AATA GATA CT TT TA TT TATT AG AG 54 0 

I II M II M II II M M II M III II II II II Mil II II II II II II M II II II II II 

Db 4 81 TTAAAAGGTTTGGAGGATACAAAGAGTCCACAGTTTAATAGATACTTTTATTTATTAGAG 54 0 

Qy 541 AATTTAGCTTGGGTTAAATCATATAACATCTGCTTTGAATTGGAAGATTGCAATGAAATT 60 0 

I II II II II M II II MM II III II II II II II MM II II M II II II II M II II M 

Db 541 AAT TT AG CT TG GG TT AAAT CATAT AA CATC TG CT TT GAAT TGGAAG AT TG CAAT GAAATT 600 

Qy 6 01 TTTATTCAG CTTTTTAGAACTCTCTTCT CAGT GATC AA CAAT AG CC AC AATAAGAAGGTA 66 0 

I II M II II II II M II II II III II II II II II II II II II II M II II M M II II II 

Db 6 01 TTTATTCAG CTTT TT AG AA CT CTCTTCTCAGTGATCAA CAAT AG CC AC AATAAGAAGGTA 66 0 
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Qy 6 61 C AAAT GC AC AT GC TAGATT TG ATG AGTT CT AT CATC AT GG AAGG TG ATGG AG TT ACTC AA 72 0 

Mill II II II II MINI Mill II MINI II II INI II II II II II II II II II II 

Db 6 61 CAAATGCACATGCTAGATTTGATG AGTT CTATCATC ATGG AAGGTGATGGAGTTACTCAA 72 0 

Qy 7 21 GAATTATTGGGCTCCATTCTT ATT AACCTCATTCCTGCACATAAGAACTT AAAT AAACAG 78 0 

I II II II II II II III I II II III II II MM II II MM II II II M MM II II II II 

Db 721 GAATTATTGGGCTCCATTCTT ATT AACCTCATTCCTGCACATAAGAACTT AAAT AAACAG 78 0 

Qy 781 TCCTTTGACCTTGCAAAAGTGCTATTGAAAAGAACAGTCCAGACTATTGAGGCATGCATT 84 0 

I II MM II M II II II II II III II II II II II II MM II II II II II II MM II II 

Db 7 81 T CC TT TG AC CT TG CAAAAG TG CTA TT GAAAAG AA CAGT CC AG AC TATTGA GG CATG CATT 84 0 

Qy 841 G CC AATTTT TT CAAT CAAGTC CTG GT GCTGGG AAGATC AT CAGT AAGTGATT TGTC AG AA 90 0 

I II MM II M II II II II II III II II II II II II II II II II II II IIIIMIIM II 

Db 841 GCCAATTTTTTCAATCAAGTCCTGGTGCTGGGAAGATCATCAGTAAGTGATTTGTCAGAA 90 0 

Qy 9 01 CATGTATTTGATCTGATTCAGGAACTTTTTGCTATAGATCCTCATTTATTATTATCCGTC 96 0 

I II II II II II II II II II II III II MM II II M II II II II II II II II II II II II 

Db 9 01 CATGTATTTGATCTGATTCAGGAACTTTTTGCTATAGATCCTCATTTATTATTATCCGTC 96 0 

Qy 9 61 ATGCCACAGCTTGAATTCAAACTAAAGAGCAATGATGGAGAAGAGCGATTAGCTGTTGTT 10 20 

I II II II II II II II II M II III II MM II II M II II II II M II II II II II M II 

Db 9 61 ATG CC AC AG CT TG AATT CA AA CTA AAGAGC AATG AT GG AG AAGAGC GATT AG CTGTTG TT 10 2 0 

Qy 1021 CGACTTCTAGCTAAATTGTTTGGCTCCAAAGATTCTGATTTGGCAACACAGAATCGTCCT 1080 

I II II II II II II II 1 1 II II III II II II II II M II II II II II II II II II II II II 

Db 10 21 CGACTTCTAGCTAAATTGTTTGGCTCCAAAGATTCTGATTTGGCAACACAGAATCGTCCT 10 80 

Qy 1081 CTTTGGCAATGTTTTCTTGGACGATTTAATGATATTCATGTTCCTGTGAGATTAGAAAGT 1140 

Ml II II II II II II II M II III II II II II II II II II II II II II II II II II II II 

Db 10 81 CTTTGGCAATGTTTTCTTGGACGATTTAATGATATTCATGTTCCTGTGAGATTAGAAAGT 114 0 

Qy 1141 GTGAAATTTGCCAGTCATTGTTTAATGAATCACCCAGATTTAGCGAAGGATCTCACAGAA 12 00 

I II II II II II II II II II II III II II II II II II M II II II II II II II II II II II 

Db 1141 GTGAAATTTGCCAGTCATTGTTTAATGAATCACCCAGATTTAGCGAAGGATCTCACAGAA 12 00 

Qy 12 01 TATTTAAAGGTTAGATCACATGATCCAGAAGAAGCTATTCGTCATGATGTCATTGTTACT 12 60 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 

Db 12 01 TATTTAAAGGTTAGATCACATGATCCAGAAGAAGCTATTCGTCATGATGTCATTGTTACT 12 60 

Qy 12 61 ATAATAACAGCTGCCAAGAGGGAC CTGGCCTTAGTAAATGATCAGCTGCTTGGCTTTGTA 13 20 

I M II II II II II II II II II Ml II II M II II II II II II II M II II II II II II II 

Db 12 61 ATAATAACAGCTGCCAAGAGGGAC CTGGCCTTAGTAAATGATCAGCTGCTTGGCTTTGTA 13 20 

Qy 1321 AGGGAAAGAACACTGGATAAACGGTGGCGAGTAAGAAAAGAAGCTATGATGGGTCTGGCT 13 80 

I M II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1321 A GGGAAAGAACACTGGATAAACGGTGGCGAGTAA GAAAAG AAGCTATGATGGGTCTGGCT 13 80 

Qy 13 81 CAG CTTTATAAGAAATACTGTCTT CATGGTGAAGCAGGAAAGGAAGCTGCAGAGAAAGTC 1440 

I M II II II II II II M M II Mill II II II II M II II II II II II II II II I! II M 

Db 13 81 CAG CT TT AT AA GAAATA CT GT CTT CATG GT GA AG CA GG AAAG GAAG CTGC AG AG AAAG TC 1440 

Qy 14 41 AGCTGGATAAAGGACAAACTTCTGCATATTTATTATCAGAACAG CATTGACGACAAACTG 15 00 

I II II II II II II II II II II Ml II II II II II II II II II II II II II II II M M II 

Db 14 41 A GC TG GA TA AA GG AC AA AC TT CTG CA TA TTTATT AT CAGA AC AG CATTGACGACAAACTG 15 00 

Qy 15 01 T TGGT AG AG AAAATCTT TG CT CAG TATC TT GT CC CC CACAAC CTGG AAACAG AAGA GAGA 15 60 

1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 II 1 1 1 II M I M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II II 1 1 II 1 1 M 1 1 

Db 15 01 TTG GT AG AG AAAATC TT TG CT CAG TATCTTGT CC CC CACAAC CT GG AAACAG AAGA GAGA 15 60 

Qy 1561 ATG AAAT GC TT AT AT TACTTATAT GCTAGT TTGGAT CC AAAT GC TGTAAAAG CT CT CAAC 1620 
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I II llllllll II II II II II III INI II II II MM II II II II II MM II II MM 

Db 1561 ATGAAATGCTTATATTACTTATATGCTAGTTTGGATCCAAATGCTGTAAAAGCTCTCAAC 1620 

Qy 16 21 GAAATGTGGAAGTGTCAGAACATGCTTCGGAGCCATGTACGCGAACTATTGGATTTGCAC 16 80 

I 1 1 1 1 1 1 1 1 1 1 M M M 1 1 1 1 I M 1 1 1 1 M M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M M M 

Db 1621 GAAATGTGGAAGTGTCAGAACATGCTTCGGAGCCATGTACGCGAACTATTGGATTTGCAC 1680 

Qy 16 81 AAGCAGCCTACATCAGAGG CTAAC TGTTCTGCCATGTTTGGAAAACTGATGACCATAGCA 174 0 

I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 M 1 1 i 1 1 1 1 1 1 1 If M F 1 1 1 

Db 1681 AAGCAGCCTACATCAGAGG CTAAC TGTTCTGCCATGTTTGGAAAACTGATGACCATAGCA 1740 

Qy 1741 AAGAATTTGCCTGACCCCGGGAAAGCACAAGATTTTGTGAAGAAATTTAACCAGGTTCTC 18 00 

II MM II MM MM II II III II MM II MM MMM MM II MM MM II II 

Db . 1741 AAGAATTTGCCTGACCCCGGGAAAGCACAAGATTTTGTGAAGAAATTTAACCAGGTTCTC 1800 

Qy 18 01 G GC GATG AT GAGAAACT TC GGTCT CAGT TG GAGTTATT AATT AG CC CAAC CT GTTCTT GC 1860 

IMIMM II II II II II II III II II II II II II II II II II II II II II II II II II 

Db 18 01 GGCGATGATGAGAAACTTC GGTCT CAGTTGGAGTTATT AATT AGCC CAAC CTGTTCTTGC 18 60 

Qy • 1861 AAA CAAG CAGATATT TGTG TG AGA GAAATAGC CCGG AAAC TT GC AAAT CC TAAG CAAC CA 1920 

I II II II II II M II II II II III II II II II M II II M M II II II M II M II II II 

Db 18 61 AAA CAAG CAGATATT TG TG TG AGA GAAATAGC CC GG AAAC TT GC AAAT CC TAAG CAAC CA 1920 

Qy 19 21 ACAAATCCTTTTCTAGAGATGGTCAAATTTCTGTTGGAAAGAATCGCACCTGTGCACATT 19 80 

II MM II MM MM II II III II MM II MM II II II MM II MM MM II II 

Db 19 21 ACAAATCCTTTTCTAGAGATGGTCAAATTTCTGTTGGAAAGAATCGCACCTGTGCACATT 19 80 

Qy 1981 GATTCAGAAGCCATAAGTGCACTAGTGAAATTGATGAATAAGTCAATAGAGGGGACAGCA 2040 

I II II II II II II II II II II III M II II M II II M II II II II INI II II II M M 

Db 19 81 GATTCAGAAGCCATAAGTGCACTAGTGAAATTGATGAATAAGTCAATAGAGGGGACAGCA 2040 

Qy 2041 GATGATGAAGAGGAGGGTGTAAGTCCAGATACAGCTATCCGTTCAGGACTTGAACTTCTT 2100 

II II II II II M II II II II III II II II II II II II M II M II II M M II II M II 

Db 2041 GATGATGAAGAGGAGGGTGTAAGTCCAGATACAG CTATCCGTTCAGGACTTGAACTTCTT 2100 

Qy 2101 AAGGTTCTGTCTTTTACACATCCTAC CTCGTTCCACTCTGCAGAGACATATGAGTCCTTG 2160 

II II II II M II II II II II III II II II II II II II II II II II II II II II II II II 

Db 2101 AAGGTTCTGTCTTTTACACATCCTACCTCGTTCCACTCTGCAGAGACATATGAGTCCTTG 2160 

Qy 2161 T TACAGT GC CT AAGA AT GG AGGAT GA CAAG GT AG CAGA AG CT GC TATT CAAATT TT TAGA 2220 

II II II II II II II II II II III M II II II II II II II II II M II II M II II II II 

Db 2161 T TA CAGT GC CT AAGAAT GG AG GAT GA CAAG GT AG CA GAAG CT GC TATT CAAATT TT TA GA 22 20 

Qy 22 21 AATACAGGT CACAAAATAGAAACAGACCTTCCCCAGATACGATCGACCTTAATTCCCATT 22 80 

I II II II II II II II II II II III II II II II II II II II II II II II II II II II Mil 

Db 2221 AATACAGGTCACAAAATAGAAACAGACCTTCCCCAGATACGATCGACCTTAATTCCCATT 2280 

Qy 22 81 TTACATCAAAAAGCAAAGAGGGGTACTCCACACCAAGCA7VAACAGGCTGTGCACTGTATA 23 40 

I II INI II MM INI II Mill II II II Mil II II II II II II MM II II II II II 

Db 22 81 TTACATCAAAAAGCAAAGAGGGGTACTCCACACCAAGCAAAACAGGCTGTGCACTGTATA 23 4 0 

Qy 23 41 C ACGCCA TATT CACAAATAAAGAAGTCCAGCTTGCA CAGA TTTTTGAGCCACTCAGTAGG 24 00 

III MM II MM II II II II III M MM M MM II II II MM II MM II II II M 

Db 23 41 C ACGCCA TATT CACAAATAAAGAAGTCCAGCTTGCA CAGA TTTTTGAGCCACTCAGTAGG 24 00 

Qy 24 01 AGTCTGAATGCTGATGTGCCAGAACAACTTATAACTCCATTAGTTTCATTGGGCCACATT 24 60 

I II II II II II II II II II M III II II II II II II II II II II II II II II M II II II 

Db 24 01 AGTCTGAATGCTGATGTGCCAGAACAACTTATAACTCCATTAGTTT CATTGGGCCACATT 24 60 

Qy 24 61 TCTATGTTAGCAC CAGATCAGTTT GCTTCCCCAATGAAATCTGTAGTAGCAAATTTTATT 25 20 

III MIMIMIIMII lllllll II llllllll llllll llllll llllllllllll II 
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Db 2461 TCTATGTTAGCACCAGATCAGTTTGCTTCCCCAATGAAATCTGTAGTAGCAAATTTTATT 2520 

Qy 2521 G TG AAAG AT CTGC TAAT GAAT GAC AGGT CA AC AG GT GAAAAG AATG GAAAAC TG TGGT CT 2580 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 2521 G TG AA AG AT CTGC TAAT GAAT GAC AG GT C A AC AG GT GAAAAG AA TG GAAAAC TGTGGTCT 2580 

Qy 2581 CCAGATGAAGAGGTTTCCCCTGAAGTACTAGCAAAGGTACAGGCAATTAAACTTCTGGTA 2640 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 25 81 C CAGATG AAGAGGTT TC CC CT GAA GT AC TAGC AAAG GT AC AG GC AATT AAAC TT CTGG TA 2640 

Qy 2641 AGGTGGCTGTTGGGTATGAAAAACAACCAGTCTAAATCTGCCAATTCAACCCTTCGGTTA 27 00 

II II II II II II M II II II III II II II II II II II II II II II II I! II II II II II 

Db 2641 AGGTGGCTGTTGGGTATGAAAAACAACCAGTCTAAATCTGCCAATTCAACCCTTCGGTTA 27 00 

Qy 2701 TTATCAGCGATGTTGGTTAGTGAGGGTGACCTGACAGAGCAAAAGAGGATCAGTAAATCT 2760 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 27 01 TTATCAGCGATGTTGGTTAGTGAG GGTGACCTGACAGAGCAAAAGAGGATCAGTAAATCT 27 60 

Qy 27 61 GATATGT CTCGCTTGCGATTAGCT GCTGGTAGTGCCATAATGAAGCTTGCTCAGGAACCT 28 20 

I II II II II MM II II INI III II II II MM II II II Mil II II II II Mil II II 

Db 27 61 GATATGTCTCGCTTGCGATTAGCTGCTGGTAGTGCCATAATGAAGCTTGCTCAGGAACCT 28 20 

Qy 2821 TGTTACCATGAAATTATTACCCCAGAACAGTTTCAG CTCTGTGCACTTGTTATTAATGAT 2880 

I I I I II I I I I I I I II I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 28 21 TGTTACCATGAAATTATTACCCCAGAACAGTTTCAGCT CTGTGCACTTGTTATTAATGAT 2 8 80 

Qy 28 81 GAGTGTTACCAAGTAAGGCAGATATTTGCTCAGAAGCTGCATAAGGCACTTGTGAAGTTA 2940 

1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 II 1 1 1 1 1 1 

Db 28 81 GAGTGTTACCAAGTAAGGCAGATATTTGCTCAGAAG CTGCATAAGGCACTTGTGAAGTTA 2940 

Qy 2941 CTGCTCCCATTGGAGTATATGGCGATCTTTGCCTTGTGTGCCAAAGATCCTGTGAAGGAG 3 0 00 

Ml II lillllll MMII Mill IMIM II II II I! II MINI MM llllll II I 

Db 2941 CTGCTCCCATTGGAGTATATGGCGATCTTTGCCTTGTGTGCCAAAGATCCTGTGAAGGAG 30 00 

Qy 3001 AGAAGAGCACACGCACGACAATGTTTACTGAAAAATATCAGTATACGCAGGGAATACATT 3060 

I t I I I I I I I I I I I I t I I I I I I I I I I I I I I I 1 I I I I I I I I I I I 1 I I I 1 I I I I I I I I i I I I I 

Db 30 01 AGAAGAGCACACG CACGACAATGT TTACTGAAAAATAT CAGTATACGCAGGGAATACATT 30 60 

Qy 3 0 61 AAGCAGAATCCTATGGCTACTGAGAAATTATTAT CACTGTTGCCTGAATATGTAGTTCCA 3120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II II I I I I I I I I I I I I I I I I M 

Db 3 0 61 AAGCAGAAT CCTATGGCTA CT GAG AAATTATT AT CACTGTTGCCTGAATATGTAGTTCCA 3120 

Qy 3121 TACATGATTCACCTGCTAGCCCATGATCCAGATTTTACAAGATCACAAGATGTTGATCAG 3180 

I 1 1 1 1 II II 1 1 1 1 1 1 1 1 II M I II 1 1 1 1 1 II I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 3121 TACATGATTCACCTG CTAGCCCATGATCCAGATTTTACAAGATCACAAGATGTTGATCAG 3180 

Qy 3181 CTTCGTGATATCAAAGAGTGCCTATGGTTCATGCTTGAAGTTTTAATGACAAAGAATGAA 32 40 

M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 i 1 1 1 1 

Db 3181 CTT CGTGATATCAAAGAGTGCCTATGGTTCATGCTTGAAGTTTTAATGACAAAGAATGAA 32 40 

Qy 3241 AACAATAGCCATGCCTTTATGAAGAAGATGGCAGAGAACATCAAGTTAACCAGAGATGCC 33 00 

I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 

Db 3241 AACAATAGCCATGCCTTTATGAAGAAGATGGCAGAGAACATCAAGTTAAC CAGAGATGCC 33 00 

Qy 33 01 CAGTCTCCAGATGAATCCAAGACAAATGAAAAACTGTATACAGTATGTGATGTGGCTCTC 33 60 

I M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M I M 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 

Db 33 01 CAGTCTCCAGATGAATCCAAGACAAATGAAAAACTGTATACAGTATGTGATGTGGCTCTC 33 60 

Qy 33 61 TGTGTTATAAATAGTAAAAGTGCTTTGTGCAATGCAGATTCACCAAAGGACCCAGTCCTC 34 20 

I M M M M M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 I M 1 1 1 1 1 1 1 1 II I 1 1 

Db 33 61 TGTGTTATAAATAGTAAAAGTGCTTTGTGCAATGCAGATTCACCAAAGGACCCAGTCCTC 34 20 
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Qy 34 21 CCAATGAAATTTTTTACACAACCTGAAAAGGACTTCTGTAACGATAAGAGTTATATTTCA 34 80 

I II II II II II II II II II M III II II II IMIIMI II II II II II II MM II II II 

Db 3421 C CAAT GAAATTTTTT AC AC AACCT GAAAAGGACT TC TGTAAC GATAAG AGTTAT ATTT CA 34 80 

Qy 34 81 GAAGAGACAAGAGTACTTCTGTTAACAGGAAAGCCAAAGC CTGCTGGAGTACTAGGTGCA 35 4 0 

I II II II II M II MM M II III II M II II II II II II II II II II II II II II II II 

Db 34 81 G AAGAGA CAAG AGTACT TC TGTTA AC AGGAAAGC CAAAGC CT GCTGGAGT ACTAGGTG CA 3540 

Qy 3541 GTAAATAAGCCTTTATCAGCAACGGGAAGGAAACCCTATGTTAGAAGCACTGGCACTGAG 36 00 

II II II MMM II II MMIM II II II II M II II II MM MM II llllll II II 

Db 3541 G TAAATAAG CCTT TATC AG CA ACG GG AA GG AAAC C C TATG TT AG AAGC AC TG GC AC TG AG 3600 

Qy 3 6 01 ACTGGAAGCAATATTAATGTAAATTCAGAG CTGAACCCTTCAACCGGAAATCGATCAAGG 3660 

MM II MINI MMIMIMI MM II llllllllll MM II M II II llllll II 

Db 36 01 ACTGG AAGC AATATTAATGTAAAT TC AG AG CTGAAC CCTT CAAC CGGAAATCGATC AAGG 36 60 

Qy 3 6 61 GAACAGAGTTCAGAGGCAGCAGAAACTGGAGTTAGTGAAAATGAAGAGAACCCTGTGAGG 3720 

MM II MINI MMIMIMI MM II II MM II II llllll II II II llllll II 

Db 36 61 GAACAGAGTTCAGAGGCAGCAGAAACTGGAGTTAGTGAAAATGAAGAGAACCCTGTGAGG 3720 

Qy 3 7 21 ATTATTTCAGTCACACCTGTAAAGAATATTGACCCAGTAAAGAATAAGGAAATTAATTCT 37 80 

I M M M M M M M M M M Ml M M M M M M M M M M M M M II M M M M 

Db 3721 ATT ATTT CAGTCACACCTGTAAAGAATATTGACCCAGTAAAGAATAAGGAAATTAATT CT 37 80 

Qy 37 81 GAT CAGG CT AC CC AG GG CA AC ATC AG CAGT GACC GAGG AAAG AAAAGA AC AGTAAC AG CA 3840 

II II II II II M II II II II III II II M II M II II II II II II II M M M II II M 

Db 3 7 81 GAT CA GG CT AC C C AG GG CAAC ATC AG CA GT GA C C GAGG AAAG AAAA GA AC AG TAAC AG CA 3840 

Qy 3 8 41 GCTGGTGCAGAGAATATCCAACAAAAAACAGATGAGAAAGTAGATGAATCGGGACCTCCC 3 9 00 

1 1 1 1 M 1 1 1 1 1 1 II II 1 1 1 1 II I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 II II 1 1 1 1 II 1 1 

Db 3841 GCTGGTGCAGAGAATATCCAACAAAAAACAGATGAGAAAGTAGATGAATCGGGACCTCCC 3900 

Qy 3 9 01 GCCCCTTCCAAACCCAGGAGAGGACGTCGACCCAAGTCTGAATCTCAGGGCAATGCTACC 3 9 60 

II II M II II II II II II II Ml II II II II II II II II M MM II II II II II II II 

Db 3 9 01 GCCCCTTCCAAACCCAGGAGAGGACGTCGACCCAAGTCTGAATCTCAGGGCAATGCTACC 3 9 60 

Qy 3 9 61 AAAAATG AT GATC TAAATA AA C CT AT TAAC AAGG GA AG GAAG AG AG CT GC AG TGGG TC AG 4 0 20 

I M 1 1 M II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 II 1 1 1 1 1 1 M II II II II 1 1 1 1 II II 

Db 3 9 61 AAAAATGATGATCTAAATAAACCTATTAACAAGGGAAGGAAGAGAGCTGCAGTGGGTCAG 4 0 20 

Qy 4021 GAGAGCCCTGGGGGTTTGGAAGCAGGTAATGCCAAAGCACCCAAACTGCAAGATTTAGCC 40 80 

II II M II MM II II II Mill MM II MINI II II MM II II II II MM II II 

Db 4 0 21 GAGAGCCCTGGGGGTTTGGAAGCAGGTAATGCCAAAGCACCCAAACTGCAAGATTTAGCC 4 0 80 

Qy 4 0 81 AAAAAGG CAGC AC CAGC AG AAAGA CAAATT GA CTTA CAAAGGTAAAAATGCATTTGCAAA 4140 

1 1 1 1 1 1 M M M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 li M II II M 1 1 II II M 

Db 4 0 81 AAAAAGGCAGCACCAGCAGAAAGACAAATTGA CTTA CAAAGGTAAAAATGCATTTGCAAA 414 0 

Qy 4141 GGG AG AAAATG AAGG CC AAAC AGA AG CAGG CT CC AG CT TC TG CAAAAACT TGGATT CACA 42 00 

I I II I I I I I I I I I I I I I M II I I I I I I I I I I I I I I I I I I I I I I I I I I I M II II I I I I II 
Db 4141 G GG AG AAAATG AAGG C C AAAC AGA AG CAGG CT CC AG CT TC TG CA AA AA CT TG GATT CA CA 42 00 

Qy 42 01 AAT GT CC CTGAAC AG AAAATG AAG CT CA CT TC AG AA CA CA CA CT CT CT GC CT TG AAAA CT 42 60 

II II II II M II II II II II Ml II II II II II II II II II II II II II II II II II II 

Db 42 01 AATGTCC CTGAAC AG AAAATG AAG CT CACTTC AG AACACACACT CT CTGC CTTGAAAACT 42 60 

Qy 42 61 AAA GAGA CT AT TA CT TC CT TTT CA CATG AC CA CAAG TC CT CT GATG GAAATGTACAGC AG 43 20 

III MM MINI MM MMIM MINIM II II II II MIIMM II llllll II M 

Db 42 61 AAAGA GACTATTACTTCCTTTTCACATGACCACAAGTCCTCTGATGGAAATGTACAGCAG 43 20 
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Qy 43 21 AAACTCTTGAGAGAGAGGCTAAAAGCAACT CTGTTCTCCCCCTTCCCCTAGACTTTTCTT 43 80 

I II MM II II II 1 1 II II II III II II II II II II II II II II II II II II II II II II 

Db 43 21 AAACTCTTGAGAGAGAGGCTAAAAGCAACT CTGTTCTCCCCCTTCCCCTAGACTTTTCTT 43 80 

Qy 43 81 ACGAAAAGTCAATAATTAAGCAAATTGCTTAACACTTGGTTCCAGTTCCTGCCTATCTGG 4440 

I M M MM II llllll II Mill M II II MM II II II II II II II IIMM II MM 

Db 43 81 ACGAAAAGTCAATAATTAAGCAAATTGCTTAACACTTGGTTCCAGTTCCTGCCTATCTGG 44 4 0 

Qy 44 41 A GT TT AA AT GC GT AATA CA CC ATT AA TT TC CA CG CT GC AG TT TT TATT TT AAAG AAAGTA 45 00 

I I M M 1 1 1 M 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 M M M 1 1 1 1 M 1 1 1 1 1 1 M M 1 1 M 1 1 M M 

Db 4441 AGT TT AAAT GC GT AATA CA CC ATT AATT TC CA CG CT GC AG TT TT TATT TT AAAG AAAG TA 45 00 

Qy 4501 ACAAG AT GT CT TT AC AC TG AC ACT GAAAATTC AT CC AT TTTAGAGC CAGG AATT CC CATG 4560 

I M M M M 1 1 M 1 1 M M M M I M M I M I II M I M I M M M 1 1 1 1 M 1 1 M M M 

Db 45 01 ACAAGATGTCTTTACACTGACACTGAAAATTCATCCATTTTAGAGCCAGGAATTCCCATG 45 60 

Qy 45 61 TTACACAGGAAAAAATAGAAGTCTACTGAATTAATTTTTTAAAAGAAAAGAGATCAGATT 46 2 0 

II II II M II II II II II II III II II II MM II M II II II II II II II II II II II 

Db 4 5 61 TTACA CAGG AAAAAATAGAAGTCT AC TG AATT AATT TTTT AAAAGAAAAGAG AT CAGATT 4 6 20 

Qy 46 21 AAATATTTCTTTGTTTTTCCTTTT GGAAACTTTTATGTATAATTCTTTCTGCCTGCCTAC 46 80 

1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 E 1 1 1 1 1 i M 1 1 1 1 1 1 

Db 46 21 AAA TATT TCTT TGTTTTTC CTTTT GGAAACTTTT ATGT AT AATT CTTT CTGC CT GC CT AC 4 6 80 

Qy 46 81 TTTTCTGCAAAAATGAGATGTACAGATTTCGGTTCCCTGCTATGAAAAGTGATGTGGTAG 4740 

II II MM II M II II II M III II II II II II II II II II II II M II II II II II II 

Db 4681 TTTTCTGCAAAAATGAGATGTACAGATTTCGGTTCCCTGCTATGAAAAGTGATGTGGTAG 4740 

Qy 4741 C AATT TT AT AA ATGT TG CT TT CTG AT TT TT AT CAGAGT GAGAAAAT TA AA AT TATT GATT 4800 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I 

Db 4741 CAATTTTATAAATGTTGCTTTCTGATTTTTATCAGAGTGAGAAAATTAAAATTATTGATT 48 00 

Qy 48 01 T GC AAGT AGTAAA CAGT TC AT ATT TT GATT TC CC CT CATTTT AG TT TAAT AT AATTTG CA 48 60 

II II II M II MM II II Mill II II II II II II M M II II II II M II II II MM 

Db 48 01 TGCAAGTAGTAAACAGTTCATATTTTGATTTCCCCT CATTTT AG TTTAAT AT AATTTGCA 48 60 

Qy 48 61 ATAAATGTACATATTGTTGTTTGTTTCATAAAGCATATCACTTTAAAATGGTTTTTACTC 4 9 20 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 4861 ATAAATGTA CATATT GT TG TT TGT TT CATAAAGC AT AT CA CT TT AAAATGGT TT TT AC TC 4920 

Qy 4921 CTGTGATTATGTTGGAATATTTGGAATTTTAAAGGAGTAAAGACTGTCCAGCATTTGGTT 49 80 

Mill MM II llllll IIMIIIII II IIIIMM II II II MM II llllll II MM 

Db 4921 CTGTGATTATGTTGGAATATTTGGAATTTTAAAGGAGTAAAGACTGTCCAGCATTTGGTT 4 9 80 

Qy 4981 T TATAATGT TT GT CACC AG AT TTT TATT AATGTA AA AAAT CAAT TT TT AAAAAATAGT TG 5040 

I II II II M M MM II lllllll M II M II II II II II II II II M MM MM II II 

Db 49 81 TTA TAAT GTTTGT CACC AG ATTTT TATT AATGTAAAAAAT CAAT TTTTAAAAAATAGTTG 50 4 0 

Qy 5041 G AC TT TG GC AG CT TT TA AG GA AAG TT GG AG GT GT TT TAGG AT TG CT AT CAAT TT TC AG C A 5100 

M II II M II II II M MM III M II II II M IMMM M M M M M M M M M 

Db 5041 GACTTTGGCAG CTTTTAAGGAAAGTTGGAGGTGTTTTAGGATTGCTAT CAATTTTCAGCA 5100 

Qy 5101 T TG TG CT AT TT GG AAAT AAGT GTT TT GC TT TT GT CT GATG GT CT GG GC TC AT TT TT AT GT 5160 

I II II MM II II II II II M III II II II M II II II II II II II M M II M II II II 

Db 5101 TTGTGCTATTTGGAAATAAGTGTTTTGCTTTTGT CTGATGGT CTGGGCTCATTTTTATGT 5160 

Qy 5161 TTATTTTAGAAAACTGTTGCATCAATATATTATGTTTCTTGGCATTGTTCAGCATAGGTA 52 20 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 5161 TTATTTTAGAAAACTGTTGCATCAATATATTATGTTTCTTGGCATTGTTCAGCATAGGTA 52 20 

Qy 52 21 ATGTGTGCACTTTATGTGTACACATAATCATATTTAAGTTTTTTGCATAAAATAAATGCT 52 80 
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llllllllllllllllllllllll IIIIIMIIIIIIIIIIIIIIIIMIMIIIIIIII 

Db 5221 ATGTGTGCACTTTATGTGTACACATAATCATATTTAAGTTTTTTGCATAAAATAAATGCT 5280 

Qy 52 81 TCTAGATGTCATGGCAGTCTTTTTAATCTTTTTATCATATGCTTTCTTGTGAATTTTTTC 53 4 0 

1111 I I 1 1 1 1 1 1 1 1 1 ! I 1 1 1 1 [ I I E 1 1 1 1 1 1 1 1 1 1 1 1 I F 1 1 

Db 52 81 TCTAGATGTCATGGCAGTCTTTTTAATCTTTTTATCATATGCTTTCTTGTGAATTTTTTC 53 4 0 

Qy 5341 ATGTTAAAGAGCTAAAGTCATAACATGATTACAGTCAACTCTCCATTATCTATATAAAAT 54 00 

I II II II MM II II II MM III II II II II II II II MM II II II II M II II II II 

Db 5341 ATGTTAAAGAGCTAAAGTCATAACATGATTACAGTCAACTCTCCATTATCTATATAAAAT 54 00 

Qy 54 01 AGTGACTAAGCCTCAGGTTTTTAATTTTGTGATAACAAAATAACGAAGGCATGTAAGACC 54 60 

I II II II II II II II II II II III II M II M II II II II II II II II II II II II II II 

Db 54 01 AGTGACTAAGCCTCAGGTTTTTAATTTTGTGATAACAAAATAACGAAGGCATGTAAGACC 54 60 

Qy 54 61 TGATT CTGG AGGAAC ATGAAATTT GTCTTTTCTC ATGT CC AG AGTT CT AT CC TG CC CC CA 55 20 

I II II II II 11.11 MM II MM! II II II II II II II II II II II II II II II II II II 

Db 5461 TGATT CTGG AGGAAC ATGAAATTT GT CTTTTC TC AT GT CC AG AGTT CT AT CC TG CC CC CA 5520 

Qy 5521 CTGTCCACTGTAGGGTCATCCGCAAAGCCCTAGCAGAATGTGCTCACTCCATTTCCTTAC 55 80 

I II II II II II II II II M II III II II II II II II II II II II M II II M II II II II 

Db 5521 CTGTCCACTGTAGGGTCATCCGCAAAGCCCTAGCAGAATGTGCTCACT CCATTTCCTTAC 55 80 

Qy 5581 ACGTTTCTAGCATGGGTCAGAGGAAACAACATTTGTGTTATAACTTCGTCTTGATAGGCT 5640 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 ! I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 ! 1 1 1 

Db 5581 ACGTTTCTAGCATGGGT CAGAGGAAACAACATTTGTGTTATAACTTCGTCTTGATAGG CT 5640 

Qy 5641 GTAGTGTACATGGGATGTAAAACAAACAAGTGTATCAAAGGTGGATGATTCTGTTAGAGT 57 00 

I II II II II M II II II II M Ml II II II II II II II II II II II II II II II II II M 

Db 5641 GTAGTGTACATGGGATGTAAAACAAACAAGTGTATCAAAGGTGGATGATTCTGTTAGAGT 57 00 

Qy 57 01 GAAGTTTGAGAGTAAATGTCACTTACGTTTCTCATAGATGATCAAGAGTTGG CTGTGTAT 57 60 

1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 It 1 1 1 1 1 1 1 1 1 1 M 1 1 

Db 57 01 G AAGTTTGAGAGTAAATGT CACTT ACGTTT CT CATAGATG AT CAAG AGTTGG CTGTGTAT 57 60 

Qy 57 61 T GA CT GAAAGA TGGG TA AT TATTT TAAATA TG CA TT TA CA CA CATT TA GG TA TC AG AAGA 58 20 

I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M 
Db 57 61 T GA CT GAAAGA TGGG TA AT TATTT TAAATA TG CA TT TA CA CA CATT TA GG TA TC AG AAGA 58 20 

Qy 5821 TGCTTAGGGAACAATGGATACCAATGATAGAAAATGATACCTTTACAGGGGCAGAAAAAT 58 80 

I II II II II II II MM II I! Ill II MINI II II MM II IIIIMM M II II II II 

Db 5821 TGCTTAGGGAACAATGGATAC CAATGATAGAAAATGATACCTTTACAGGGGCAGAAAAAT 5880 

Qy 58 81 CCCCACTCTTCCTTATTGCCT CTT CAGAACCCTTTAGAAAGTATAAAATATTGCCTCCAA 5940 

I II II II MM II MM MM III II MINI II II MM II IIIIMM II II II II II 

Db 5881 C CC CA CT CT TC CT TATT GC CT CTT CAGAAC CC TT TAGAAAGT AT AAAATATT GC CT CC AA 5940 

Qy 5941 CATGCTGAAAAAGAGTATCTATGCATAAGTATCAGAGAAGTCCCTCAAGCAATCAGTAGG 60 00 

I II II M II II II II II M II III II II II M II II II M II M II II II II II II II II 

Db 5941 CATGCTGAAAAAGAGTATCTATGCATAAGTAT CAGAGAAGTCCCTCAAGCAATCAGTAGG 60 00 

Qy 60 01 TGTGTTCTATTTAGAGAGAGTTTAAAGTTCTCTTAGCATCAGACAACTTGATTCCTAAGG 60 60 

I M II II II II II M II II II III II II MM II II II M II II M II II II II II II II 

Db 60 01 TGTGTTCTATTTAGAGAGAGTTTAAAGTTCTCTTAGCATCAGACAACTTGATTCCTAAGG 60 60 

Qy 6 0 61 TTTCCAGTGTGTCACCAACAAAAAGTGCATTGATAGGGACCTTTGTCTCTTCCTCCCTTT 6120 

Ml M MM II MM II II II III II MM II MM MM II MM II II II II II II II 

Db 60 61 TTTCCAGTGTGTCACCAACAAAAAGTGCATTGATAGGGAC CTTTGTCTCTTCCTCCCTTT 612 0 

Qy 6121 G ATTAATTG CC CGGC AT CACAGTT TACT AG AT TA CC AAGT GT TACATC AT AT TAAATAAA 6180 

IMMMMMMIIMM II III II IIMII M II MM II II MUM II II MINI 
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Db 6121 GATTAATTGCCCGGCATCACAGTTTACTAGATTACCAAGTGTTACATCATATTAAATAAA 6180 

Qy 6181 ATGTAGCAGAACCATCTGCATCAATATATTCCTGTTTAGATTTTTGCAGGAGAGAAGTTA 62 4 0 

I MINI II II II II II II II III II II MM II llllll II II II MM II II II II II 

Db 6181 ATG TAGC AG AA CC AT CT GC AT CAA TATATT CC TG TT TAGATT TT TG CAGG AG AG AAGT TA 62 4 0 

Qy 6241 AAAGGATTTGCTCCTTGTATGATGTAAGTGGCCCACCCCAATTTTGTAACATGATGCAAG 63 00 

I II II II II II II II II II Mill MM II II II MM II II II II II II II Mil MM 

Db 6241 AAAGGATTTGCTCCTTGTATGATGTAAGTGGCCCACCCCAATTTTGTAACATGATGCAAG 63 00 

Qy 63 01 T GT CT GG CA CT AAGG GAAG CAAGA GT AG GG TT GT GG AAAG AC CAAG CT GATG GG GAGG GA 63 60 

II 1 1 1 1 1 1 1 M M II II 1 1 1 1 II I 1 1 1 1 1 1 II 1 1 II II M M 1 1 1 1 II M 1 1 II M 1 1 1 1 

Db 63 01 T GT CT GG CA CT AAGG GAAG CAAGA GT AG GG TT GT GG AAAG AC CAAG CT GATG GG GAGG GA 63 60 

Qy 63 61 CTTGTTTACGGGAATTTTTTTAGTTTTCCTTTTCAAAGGAAAACATTAAAATCCCTTAGG 64 20 

I I M I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II 1 1 II 1 1 1 1 1 1 II II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 

Db 63 61 CTTGTTTACGGGAATTTTTTTAGT TTTCCTTTTCAAAGGAAAACATTAAAATCCCTTAGG 64 20 

Qy 64 21 AATTTGGTATTCACATCTCAGAGAACTACAACACAAAAGTGCAGACTTATATTTGAGAAT 64 80 

I I I I I I I I I I I I I I I I I II t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 64 21 AATTTGGTATTCACATCTCAGAGAACTACAACACAAAAGTGCAGACTTATATTTGAGAAT 64 80 

Qy 64 81 T AATGTT AA CC CTTT GT GT CT AGT TT GAAG CT TCTT GT AT TT GT CT AA AA CAAC AAGC CA 6540 

I I I I I I I I I I I I I I I I I II II I II I I I I I I I I I I I I II I I I I I I I I I I II II II I I I I I I 
Db 64 81 T AATGTT AA CC CT TT GT GT CT AGT TT GAAG CT TCTT GT AT TT GT CTAAAA CAAC AAGC CA 6540. 

Qy 6541 GAATTTTGTATCTCCTTTGATAAAAAGTGTGTATAATGTAAAGTAGTTTTGCATATTCTT 66 00 

I I I I I II I I I I II I I II I I I I II I I I I II I I I I I II II I I I I II I I II I I II II I I II I I 

Db 6541 GAATTTTGTATCTCCTTTGATAAAAAGTGTGTATAATGTAAAGTAGTTTTGCATATTCTT 66 00 

Qy 66 01 GTGCTGCACATGGGCTGAATTTTTAAATTTTTTTTAAAAACTTGAAGCAGAACCTTGTAA 66 60 

I IMIIIIMI II MINI Mill II llllll llllll llllllllll llllll llllll 

Db 66 01 GTGCTGC AC AT GGGCTGAATTTTTAAATTTTTTTTAAAAACTTG AAGC AG AACCTTGTAA 66 6 0 

Qy 6661 TTT GTGT AAATGACAAG TG TAAAATC CT AC CATAAAAT GC TAAAAATATGCACT GTTT CA 6720 

I MM II II II II II MM II IIMIII II II II M II II INI II II II II II II II M 

Db 6661 TTTGTGTAAATGACAAGTGTAAAATCCTAC CATAAAATGCTAAAAATATGCACTGTTTCA 6720 

Qy 67 21 AATAAAACCAAGAAATGCAGCATT 6 74 4 

I II II II II II II II II II II III 

Db 67 21 AATAAAACCAAGAAATGCAGCATT 6 74 4 
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PF 29-JUL-2004; 2 004WO-US0 24 424 . 
XX 

PR 29-JUL-2003; 2 00 3US - 04 9 08 90 P . 
XX 

PA (BRIM ) BRISTOL-MYERS SQUIBB CO. 
XX 

PI Li M , Rupnow BA, Webster KR, Jackson DG, Wong TW; 
XX 

DR WPI; 2005-163068/17. 

DR P-PSDB; ADX06263. 
XX 

PT Biomarkers useful for predicting or determining the response of a mammal 

PT to a cancer treatment comprising administration of a modulator of cyclin- 

PT dependent kinase activity. 
XX 

PS Claim 5; SEQ ID NO 827; 141pp; English. 
XX 

CC This invention describes a novel method of predicting or determining 

CC whether a mammal will respond or is responding to an anti -cancer agent 

CC that modulates cyclin-dependent kinase (cdk) activity. The method 

CC comprises measuring the level of one or more biomarkers selected from 

CC 2774 biomarkers given in the specification (nucleotide sequence SEQ ID 

CC NO:1246 (Genbank EST W28729)^is especially preferred) . The method of the 

CC invention is utilized in a kit for determining or predicting whether 

CC patient would be susceptible or resistant to treatment by an agent 

CC modulating cdk activity. The invention also describes a method for 

CC utilizing individualized genetic profiles for treating diseases and 

CC disorders based on patient's response and molecular level, specialized 

CC microarrays comprising the biomarkers described, antibodies directed 

CC against the biomarkers and a cell culture model to identify biomarkers. 

CC The cdk modulator is pref erably N-5 - [ [5 - ( 1 , 1 -Dimethylethyl ) -2 - 

CC oxazolyl] methyl] thio] -2- thiazolyl -4 -piperidine carboxamide, 0.5-L- 

CC tartaric acid salt. Note: The sequence data for this patent did not form 

CC part of the printed specification, but was obtained in electronic format 

CC directly from WIPO at ftp.wipo.int/pub/published_pct_sequences. This 

CC sequence encodes a biomarker used in the method of the invention. 

XX 

SQ Sequence 6744 BP; 2156 A; 1243 C; 1400 G; 1945 T; 0 U; 0 Other; 



Query Match 100.0%; Score 6744; DB 14; Length 6744; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 6 744; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


i 


GGACCTCGCAGGCCAAGAATTCGGCACGAGGGGCGCCGGCTCCCGGGGCACGGACGGCCG 


60 






II II II II II II II II II Mill II II II II l! II 1! MINI II II II II II II II 1! 




Db 


i 


GGACCTCGCAGGCCAAGAATTCGGCACGAGGGGCGCCGGCTCCCGGGGCACGGACGGCCG 


60 


Qy 


61 


GGCGCGCGCCTCTGCGAGGGGCGTCCGGGTCCGAGTCGGCGGTCCGGGCCGGCGCGAGGT 


120 






II II MM II MM II II Mill II II II II li II II III: II !l II II II II II II II 




Db 


61 


GGCGCGCGCCTCTGCGAGGGGCGTCCGGGTCCGAGTCGGCGGTCCGGGCCGGCGCGAGGT 


120 


Qy 


121 


GCGTGCGGGCGGGCCGCGGGGGTCCCGGACGGACACAAGCGCACACACTCCCGGAAGATC 


180 






II II MM II II II II II Mill II II M II II II II MM II II II II II II M II II 




Db 


121 


GCGTGCGGGCGGGCCGCGGGGGTCCCGGACGGACACAAGCGCACACACTCCCGGAAGATC 


180 


Qy 


181 


GCTTACCCTCCGGGGGTAAAAGAGAT CACCGACAAGATCACCACGGACGAGATGATCAAA 


240 






II II MM II II II II II MIM II II M II M II II MM II M II II II II M M II 




Db 


181 


G CT TA CC CT CC GG GG GT AA AAGAG AT CACC GA CA AG AT CACC AC GG AC GAGATG AT CAAA 


240 


Qy 


241 


C GC CT GAAG AT GG TAGT GA AAACC TT TATG GATATG GATC AG GA CT CAGAAG ATGAAAAA 


300 



IIIIIIIIMIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
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Score Hom e Page Retrieve App licati on List SCORE S ystem Overview SC ORE FAQ Comments / Suggestions 

This page gives you Search Results detail for the Application 10679580 and Search Result us-10-679-580a- 

l.rst. 

start 

Go Back to previous page 

GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 

Run on: July 24, 2006, 02:05:21 ; Search time 28579 Seconds 

{without alignments) 
13195.702 Million cell updates/sec 

US-10-679-580A-1 
Perfect score: 6744 

1 ggacctcgcaggccaagaat aaaccaagaaatgcagcatt 6744 

IDENTITY_NUC 
Gapop 10.0 , Gapext 1.0 



Title : 
Per fed 
Sequence : 

Scoring table : 



Searched: 



48236798 seqs , 27959665780 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 10 0% 
Listing first 45 summaries 



96473596 



Database : 



EST: * 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 



gb_estl : * 
gb_est3 : * 
gb_est4 : * 
gb_est5 : * 
gb_est6 : * 
gb_htc: * 
gb_est2 
gb_est7 ; 
gb_est8 : 
gb_est9: * 
gb_gssl : * 
gb_gss2 : * 
gb_gss3 : * 
gb_gss4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


5 070. 


6 


75 


. 2 


513 4 


6 


CR74942 5 


CR7 4 942 5 


Homo sapi 


2 


3 62 8 . 


2 


53 


. 8 


4 019 


14 


DQ04 98 35 


DQ04 9835 


• Homo sapi 


3 


3 385. 


8 


50 


. 2 


4 019 


14 


DQ04 98 36 


DQ 04 98 36 


» Pan trogl 


4 


93 2 . 


4 


13 


. 8 


2 89 9 


6 


AK 04168 2 


AKO 41682 


Mus muscu 


5 


865 . 


2 


12 


. 8 


93 9 


3 


BQ959 50 9 


BQ9 59 50 9 


TV T~1"KT /~>/""\T T n Tl 

AG EN COURT 


6 


85 2. 


8 • 


12 


. 6 


118 2 


2 


BM4 54 791 


BM4 54791 


AGENCOURI 


7 


85 0. 


8 


12 


. 6 


90 9 


3 


BUI 95 79 8 


n tti nnn n 

BUI 95 798 


AG EN COURT 


8 


848. 


2 


12 


. 6 


108 8 


2 


BM475 38 3 


BM4 75 383 


AG EN COURT 


9 


84 5. 


8 


12 


. 5 


92 3 


3 


BQ4 3 8 312 


BQ4 3 8 312 


AG EN COURT 


10 


841. 


4 


12 


. 5 


93 1 


10 


DV92 82 25 


DV92 8225 


• LB03012.C 


1 1 


833 . 


8 


12 


. 4 


92 2 


8 


CR986 73 3 


CR98673 3 


CR98673 3 


12 


832. 


2 


12 


. 3 


1106 


3 


BM799 673 


BM7 99673 


AG EN COURT 


13 


827. 


8 


12 


. 3 


874 


3 


BU517 176 


BU5 17176 


AG EN COURT 


14 


826. 


2 


12 


. 3 


85 9 


8 


CR999 562 


CR999562 


CR99 956 2 


15 


819. 


8 


12 


. 2 


991 


2 


BM470 743 


BM4 70743 


AG EN COURT 


16 


817. 


4 


12 


. 1 


867 


3 


BU159 962 


BU159962 


AG EN COURT 


17 


814 . 


6 


12 


. 1 


114 2 


2 


BM466 282 


BM4 66282 


AGENCOURT 


18 


810 


12 


. 0 


1002 


2 


BM465 904 


BM4 65904 


AG EN COURT 


1 9 


808. 


8 


12 


. 0 


83 5 


9 


DA735 319 


DA7 35 319 


DA73 5319 


2 0 


800. 


4 


11 


. 9 


85 6 


3 


BU166 77 0 


BU166770 


AGENCOURT 


2 1 


791. 


4 


11 


. 7 


1056 


2 


BM477 53 1 


BM4 7753 1 


AGENCOURT 


22 


782. 


6 


11 


. 6 


3 313 


6 


AK045 15 9 


AK04515 9 


Mus muscu 


23 


776. 


4 


11 


. 5 


792 


8 


CN4 3 0 32 8 


CN4 3 0 328 


17 00 053 22 


24 


770 . 


8 


11 


. 4 


853 


3 


BQ440 34 9 


BQ4 4034 9 


AGENCOURT 


2 5 


767. 


4 


11 


. 4 


10 92 


2 


BM4 65 56 6 


BM46 55 66 


AGENCOURT 


26 


761 


11 


. 3 


764 


9 


DA73 6 98 5 


DA736985 


DA73 698 5 


27 


75 5 


11 


. 2 


766 


8 


CN4 3 0 32 9 


CN4 3 0 32 9 


17 00 053 19 


28 


754 . 


2 


11 


. 2 


3378 


6 


AK045 73 7 


AK04573 7 


Mus muscu 


2 9 


74 9 


11 


. 1 


773 


1 


AL705 02 9 


AL7 05 02 9 


DKFZp68 6B 


3 0 


74 8. 


8 


11 


. 1 


97 9 


2 


BG758 965 


BG758965 


6027 130 79 


3 1 


74 7. 


8 


11 


. 1 


78 6 


2 


BG208 101 


BG2 08101 


RST2 759 1 


3 2 


745 


11 


. 0 


864 


8 


CV884 014 


CV8 84 014 


UI ~M -HZ0 - 


3 3 


743 . 


8 


11 


. 0 


78 0 


3 


BU630464 


BU630464 


UI -H -FLO - 


34 


741 


11 


. 0 


796 


1 


AU137 77 1 


AU137 77 1 


AU13 777 1 


35 


731. 


6 


10 


. 8 


1047 


8 


C0647378 


C0647378 


ILLUMIGEN 


36 


729. 


6 


10 


.8 


766 


3 


BU631648 


BU631648 


UI-H-FL0- 


37 


729 


10 


.8 


761 


8 


CN430334 


CN430334 


170006001 


38 


728 


10 


.8 


825 


2 


BI254793 


BI254793 


602974029 


39 


726. 


8 


10 


.8 


785 


1 


AU137328 


AU137328 


AU13732 8 


40 


72 0. 


8 


10 


.7 


745 


3 


BU631677 


BU631677 


UI-H-FL0 - 


41 


720. 


4 


10 


.7 


752 


8 


CN430332 


CN430332 


170004248 


42 


719 


10 


.7 


719 


9 


CX868 031 


CX868031 


HESC4_2 5_ 


43 


718 


10 


.6 


878 


3 


BM801605 


BM801605 


AGENCOURT 


44 


715. 


2 


10 


.6 


738 


1 


AL045878 


AL045878 


DKFZp43 4J 


45 


712 . 


4 


10 


.6 


823 


3 


BU703952 


BU703952 


UI-M-FO0- 



ALIGNMENTS 



RESULT 1 
CR749425 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 



CR749425 5134 bp mRNA linear HTC 17-APR-2005 

Homo sapiens mRNA; cDNA DKFZp6 86B1 92 4 6 (from clone DKFZp68 6B192 46 ) . 
CR749425 

CR749425. 1 GI : 51476560 
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KEYWORDS HTC . 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Euarchontoglires ; Primates; Catarrhini; 

H om in idae ; Homo . 
REFERENCE 1 (bases 1 to 5134) 

AUTHORS Koehrer,K., Beyer, A., Mewes,H.W., Weil,B., Amid,C, Osanger,A., 

Fobo,G. , Han,M. and Wiemann,S. 
CONSRTM The German cDNA Consortium 
TITLE Direct Submission 

JOURNAL Submitted ( 17 -AUG -200 4 ) MIPS, Ingolstaedter Landstr.l, D-85764 
Neuherberg, GERMANY 
COMMENT Clone from S. Wiemann, Molecular Genome Analysis, German Cancer 

Research Center (DKFZ ) ; Email s.wiemann@dkfz-heidelberg.de; 
sequenced by BMFZ (Biomedical Research Center at the 
Heinrich-Heine-University, Duesseldorf /Germany ) within the cDNA 
sequencing consortium of the German Genome Project. This clone 
(DKFZp686B19246 ) is available at the RZPD Deutsches 
Ressourcenzentrum fuer Genomf orschung GmbH in Berlin, Germany. 
Please contact RZPD for ordering: 

http: //www. rzpd .de/ eg i- bin /product s/cl . cgi ?CloneID=DKFZp68 6B1 92 4 6 
Further information about the clone and the sequencing project is 
available at http://mips.gsf.de/projects/cdna/. 
FEATURES Location/Qualifiers 
source 1. .5134 

/organism= "Homo sapiens" 

/mol__type= "mRNA" 

/db_xr ef = " taxon : 9 6 0 6 " 

/clone="DKFZp6 86B1924 6" 

/tissue_type="salivary gland" 

/clone_lib=" 686 (synonym: hlcc3). Vector pSportl_Sfi; host 

DH10B; sites SfilA + SfilB" 

/dev_stage="adult " 

/note= "hypothetical protein" 
gene 1. .5134 

/gene="DKFZp686B19246" 
CDS 30. .3923 

/gene="DKFZp686B19246 " 

/codon_start=l 

/product = " hypo the ti ca 1 prot ein" 
/protein_id= "CAH1 82 63.1 " 
/db_xref ="GI : 514 7 65 61" 
/db_xr e f = " GO A : Q6 8 DF 7 " 
/db_xref =" InterPro: IPR0 00357" 
/db_xref - " UniProt KB/TrEMBL : Q6 8DF7 " 

/trans la tion= "MIKRLKMWKTFMDMDQDSEDEKQQYLPLALHLASEFFLRNPNK 
DVRLLVACCLADIFRIYAPEA PYTSHDKLKDIFLFITRQLKGLEDTKSPQFNRYFYLL 
ENLAWVKSYNICFELEDCNEIFIQLFRTLFSVINNSHNKKVQMHMLDLMSSIIMEGDG 
VTQELLDSILINLIPAHKNLNKQSFDLAKVLLKRTVQTIEACIANFFNQVLVLGRSSV 
SDLSEHVFDLIQELFAIDPHLLLSVMPQLEFKLKSNDGEERLAWRLLAKLFGSKDSD 
LATQNRPLWQCFLGRFNDIHVPVRLESVKFASHCLMNHPDLAKDLTEYLKVRSHDPEE 
AIRHDVIVTIITAAKRDLALVNDQLLGFVRE RT LDKRWRVRKEAMMGLAQLYKKYCLH 
GEAGKEAAEKVSWIKDKLLHI YYQNSIDDKLLVEKIFAQYLVPHNLETEERMKCLYYL 
YASLDPNAVKALNEMWKCQNMLRSHVRELLDLH KQ PTSEANCSAMFGKLMTIAKNL PD 
PGKAQDFVKKFNQVLGDDEKLRSQLELLI SPTCSCKQADICVREIARKLANPKQPTNP 
FLEMVKFLLERIAPVHIDSEAISALVXLMNKSIEGTADDEEEGVS PDTAIRSGLELLK 
VLSFTHPTSFHSAETYESLLQCLRMEDDKVAEAAIQI FRNTGHKIETDLPQIRSTLIP 
ILHQKAKRGT PHQAKQAVHCIHAIFTNKEVQLAQI FEPLSRSLNADVPEQLITPLVSL 
GHISMLAPDQ FASPMKS WANFI VKDLLMNDRSTGEKNG KLWSPDEEVSPEVLAKVQA 
IKLLVRWLLGMKNNQSKSANSTLRLLSAMLVSEGDLT EQKRISKSDMSRLRLAAGSAI 
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MKLAQEPCYHEIITPEQFQLCALVINDECYQVRQIFAQKLHKALVKLLLPLEYMAIFA 
LCAKDPVKERRAHARQCLLKNISIRREYIKQNPMATEKLLSLLPEYVVPYMIHLLiAHD 
PDFTRSQDVDQLRDIKECLWFMLEVLMTKNENNSHAFMKKMAENIKLTRDAQSPDESK 
TNEKLYTVCDVALCVINSKSALCNADSPKDPVLPMKFFTQPEKDFCNDKSYISEETRV 
LLLTGKPKPAGVLGAVNKPLSATGRKPYVRSTGTETG SNINVNSELNPSTGNRSREQS 
SEAAETGVSENEENPVRIISVTPVKNIDPVKNKEINSDQATQGNISSDRGKKRTVTAA 
GAENIQQKTDEKVDESGPPAPSKPRRGRRPKSESQGNATKNDDLNKPINKGRKRAAVG 
QESPGGLEAGNAKAPKLQDLAKKAAPAERQIDLQR " 



ORIGIN 



Query Match 7 5.2%; Score 50 70.6; DB 6; Length 513 4; 

Best Local Similarity 99.9%; Pred . No. 0; 

Matches 5085; Conservative 0; Mismatches 4; Indels 2; Gaps 1; 
Qy 2 03 AGATCAC CGACAAGATCACCACGGACGAGATGATCAAACGCCTGAAGATGGTAGTGAAAA 26 2 

Mill MM MM II II MM III II II Mllll II llllllll II II II II II II MM 

Db 1 AGATCACCGACAAGATCACCACGGACGAGATGATCAAACGCCTGAAGATGGTAGTGAAAA 60 

Qy 2 63 CCTTTATGGATATGGATCAGGACT CAGAAGATGAAAAACAGCAGTATCTCCCACTAGCCT 32 2 

I M II II II II II II II II M III II M MMM II II II II II M II II II II MM II 

Db 61 C CT TT AT GG AT AT GG AT CAGG ACT CA GAAG AT GAAA AA CAGC AG TATC TC CC AC TAGC CT 12 0 

Qy 3 23 T GC AT CT TG CATC TG AATT CT TCC TC AG GA AC CC CA AT AAAG AT GT GCGT CT CC TTGT AG 382 

I I I I I I 1 I I I I I I I I I 1 I I I I I 1 1 I 1 I I I 1 I I I I I 1 I I I I I I I 1 I I I I t I I I I I I 1 I I I I 

Db 121 TGCATCTTGCATCTGAATTCTTCCTCAGGAACCCCAATAAAGATGTGCGTCTCCTTGTAG 18 0 

Qy 3 83 CAT GT TG TT TG GC TG AT AT CT TT C GT AT CT AT GC CC CA GAAG CT CC AT AT AC TT C C CA TG 44 2 

I II II II II II M II II II II Ml II II II II II II II II II II II II II II II II II II 

Db 181 CATGTTGTTTGGCTGATATCTTTCGTATCTATGCCCCAGAAGCTCCATATACTTCCCATG 24 0 

Qy 4 43 ATAAACTTAAGGACATATTTTTGTTTATTACCAGACAATTAAAAGGTTTGGAGGATACAA 50 2 

I MINI M MM II II M II III II M II MM II II MINI II MINI II M MM 

Db 2 41 ATAAACTTAAGGACATATTTTTGTTTATTACCAGACAATTAAAAGGTTTGGAGGATACAA 3 0 0 

Qy 5 03 AGAGT CC AC AGTT TAAT AG AT ACT TT TATTTATT AG AG AATT TAGC TTGG GT TAAATC AT 562 

I 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M M 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 M 

Db 3 01 AGAGT CC AC AGTT TAAT AG AT ACT TT TATTTATT AG AG AATT TAGC TT GGGT TAAATC AT 36 0 

Qy 5 63 ATAAC AT CT GC TT TG AA TT GG AAG AT TG CAATGAAA TT TT TATT CAGC TT TT TAGAAC TC 62 2 

' I 1 1 1 1 M M M M M M M 1 1 M I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M M 1 1 M 1 1 M 1 1 

Db 3 61 ATAAC AT CT GC TT TG AATT GG AAG AT TG CA AT GA AATT TTTATT CAGC TT TT TA GA AC T C 42 0 

Qy 6 23 TCTTCTCAGTGATCAACAATAGCCACAATAAGAAGGTACAAATGCACATGCTAGATTTGA 68 2 

I 1 1 1 M I 1 1 M 1 1 M M M M I II 1 1 1 1 1 1 1 1 1 1 1 1 I M I 1 1 M 1 1 M 1 1 M 1 1 M 1 1 M 

Db 4 21 TCTTCTCAGTGATCAACAATAGCCACAATAAGAAGGTACAAATGCACATGCTAGATTTGA 48 0 

Qy 6 83 T GAGT TC TATC AT CA TG GAAG GTG AT GG AG TT AC TC AAGA AT TATT GG GC TC CATT CT TA 74 2 

I M I M I 1 1 M M 1 1 1 1 M 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M I 1 1 1 1 1 1 M 1 1 

Db 4 81 T GAGT TC TATC AT CATG GAAG GTG AT GG AG TT AC TC AAGAAT TATT GG AC TC CATT CT TA 54 0 

Qy 743 TTAACCTCATTCCTG CACATAAGAACTTAAATAAACAGTCCTTTGACCTTGCAAAAGTGC 802 

I I M M I I M I M 1 1 1 1 1 1 1 1 M I 1 1 1 1 1 1 M M 1 1 1 1 M M II 1 1 1 1 1 1 M 1 1 M 1 1 M 

Db 5 41 TTAACCTCATTCCTG CACATAAGAACTTAAATAAACAGTCCTTTGACCTTGCAAAAGTGC 60 0 

Qy 8 03 TATTGAAAAGAACAGTCCAGACTATTGAGGCATGCATTGCCAATTTTTTCAATCAAGTCC 862 

I II MM MM II II II II II Ml MM Mllll II II II 1 1 1 1 1 J 1 1 1 1 1 1 1 II 1 1 1 1 

Db 6 01 TATTGAAAAGAACAGTCCAGACTATTGAGGCATGCATTGCTAATTTTTTCAATCAAGTCC 66 0 

Qy 8 63 TGGTG CT GG GAAG AT CATC AGTAA GT GATT TGTC AG AA CATG TATT TG AT CTGATT CAGG 92 2 

I II II II II MM II II MM III II II II II II II II II INI II II II II II II II II 

Db 6 61 TGGTG CT GG GAAG AT CATC AG TAA GT GATT TG TC AG AA CATG TATT TG AT CTGATT CAGG 72 0 
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SCORE Search Results Details for Application 10679580 
and Search Result us-10-679-580a-2.rag. 

Score Home Pa ge Retrieve A p plication List SCORE S ystem Overview SCORE FAQ Comments / Su gg estio ns 

This page gives you Search Results detail for the Application 10679580 and Search Result us-10-679-580a- 

2. rag. 

start 

Go Back to pr evious page 

GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title : 
Perfect score: 6650 



July 21, 2006, 21:02:13 ; Search time 203 Seconds 

(without alignments) 
2921.230 Million cell updates/sec 

US-10-679 -580A-2 



Sequence : 



1 MIKRLKMWKTFMDMDQDSE KL QD LAKK AA PAERQ I DL QR 1297 



Scoring table: BLOSUM6 2 

Gapop 10.0 , Gapext 0.5 



Searched: 



2589679 seqs, 457216429 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 20 00 00 00 00 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



2589679 



Database 



A_Geneseq_8 : 



1 

2 
3 
4 
5 
6 
7 
8 
9 

10 



geneseqpl980s : * 
geneseqpl990s : * 
geneseqp2 000s : * 
geneseqp2 001s : * 
geneseqp2 002s : * 
genes eqp2 00 3a s : * 
genes eqp2 00 3b s : * 
geneseqp2 004s : * 
geneseqp2 005s : * 
geneseqp20 06s : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



Score 



% 

Query 

Match Length DB 



ID 



Description 
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Abr58297 
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6618 
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.5 
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6 


ABG72803 


Abg72803 


Human turn 


4 


4711 


70 


.8 


919 


5 


ABP64685 


Abp64685 


Human pro 


5 


4668 . 5 


70 


.2 


1391 


3 


AAY94702 


Aay94702 


Human and 


6 


4363 


65 


. 6 


854 


3 


AAB01382 
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Neuron- as 


7 


3494 


52 


. 5 
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6 


ABR58332 
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XM_047529 


8 
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42 


.7 


600 


7 


ADM06198 


Adm06198 


Human pro 


9 


2842 


42 


. 7 


600 


9 


AEC8912 8 


Aec89128 


Human cDN 


10 


2439 . 5 


36 


. 7 


846 


7 


ADJ69898 


Adj 69898 


Human hea 


11 


1911 


28 


. 7 


1218 


4 


ABB67869 


Abb67869 


Drosophil 


12 


1861 


28 


.0 


363 


3 


AAB58462 


Aab58462 


Lung cane 


13 


1715 


25 


.8 


333 


4 


AAB94 94 6 


Aab94 94 6 


Human pro 


14 


1440 . 5 


21 


. 7 


588 


8 


ADR10188 


Adrl0188 


Human pro 


15 


1187 


17 


.8 


233 


4 
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Aab9544 9 


Human pro 


16 


910 


13 


.7 


182 


5 


ABB84560 


Abb84560 


Human euc 


17 
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13 


.0 


165 


4 


AAU0175 9 


Aau01759 


Human sec 


18 
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11 


.4 


161 


4 


AAU0176 0 


Aau01760 


Human sec 


19 


387 


5 


.8 


147 


5 


ABB89260 


Abb89260 


Human pol 


20 


377 


5 


.7 


92 


3 


AAG02 811 


Aag02811 


Human sec 


21 


314 


4 


.7 


101 


3 


AAY65247 


Aay65247 


Human 5 ' 


22 


314 


4 


.7 


101 


8 


ADU72811 


Adu72811 


Signal pe 


23 


314 


4 


.7 


101 


9 


ADZ73802 


Adz73802 


Human inc 


24 


217 . 5 


3 


.3 


1743 


2 


AAW98 87 9 


Aaw98879 


H. pylori 


25 


214 . 5 


3 


.2 


1875 


6 


ABR53560 


Abr53560 


Protein s 


26 


214 .5 


3 , 


. 2 


1875 


7 


ADK64 38 0 


Adk64380 


Disease t 


27 
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3 


.2 


1875 


8 


ADS43855 
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.2 
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8 
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Disease t 
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.1 - 
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8 
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Bacterial 


32 
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. 1 
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9 


ADY85285 


Ady85285 


Yeast horn 


33 


204 


3 


. 1 


759 


7 


ABM88120' 


Abm88120 


Rice abio 


34 


199 . 5 


3 


.0 


960 


4 


AAB94466 


Aab94466 


Human pro 


35 


199 . 5 


3 


.0 


1015 


4 


AAG64 04 0 


Aag64 04 0 


Human DRF 


36 


199 . 5 


3 


. 0 


1015 


9 


ADY20095 


Ady20095 


PRO polyp 


37 


199. 5 


3 


.0 


1015 


9 


ADY17460 


Adyl7460 


PRO polyp 


38 


199.5 


3 


.0 


1160 


4 


ABB59418 


Abb59418 


Drosophil 


39 


199 


3 


. 0 


2568 


9 


ADW44 25 3 


Adw44253 


Human cen 


40 


199 


3 


.0 


2633 


4 


ABG06505 


Abg06505 


Novel hum 


41 


199 


3 


.0 


2663 


4 


AAM39097 


Aam39097 


Human pol 


42 


199 


3 


.0 


2663 


8 


ADQ17 93 2 


Adql7 93 2 


Human sof 


43 


199 


3 


. 0 


2663 


9 


ADX06867 


Adx06867 


Cyclin-de 


44 


199 


3 


. 0 


2688 


4 


AAM4 0 88 3 


Aam4 0 88 3 


Human pol 


45 


198.5 


3 


.0 


1927 


6 


ABU30805 


Abu3 0 80 5 


Protein e 



ALIGNMENTS 



RESULT 1 
ADX06263 

ID ADX06263 standard; protein; 1297 AA . 
XX 

AC ADX0 62 63 ; 
XX 

DT 21-APR-2005 (first entry) 
XX 

DE Cyclin -dependent kinase modulation biomarker SEQ ID NO 828. 
XX 
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KW cytostatic; cycl in -dependent kinase; cdk; biomarker. 
XX 

OS Homo sapiens . 
XX 

PN WO2005012875-A2 . 
XX 

PD 10-FEB-2005. 
XX 

PF 29-JUL-2004; 2 004WO-US0 24 42 4 . 
XX 

PR 29-JUL-2003; 2 003US- 049 08 90P . 
XX 

PA (BRIM ) BRISTOL-MYERS SQUIBB CO. 
XX 

PI Li M, Rupnow BA, Webster KR, Jackson DG, Wong TW; 
XX 

DR WPI; 2005-163068/17. 

DR N-PSDB; ADX06 26 2. 
XX 

PT Biomarkers useful for predicting or determining the response of a mammal 

PT to a cancer treatment comprising administration of a modulator of cyclin- 

PT dependent kinase activity. 
XX 

PS Claim 5; SEQ ID NO 828; 141pp; English. 
XX 

CC This invention describes a novel method of predicting or determining 

CC whether a mammal will respond or is responding to an anti -cancer agent 

CC that modulates cyclin-dependent kinase (cdk) activity. The method 

CC comprises measuring the level of one or more biomarkers selected from 

CC 2774 biomarkers given in the specification (nucleotide sequence SEQ ID 

CC NO:1246 (Genbank EST W28729) is especially preferred) . The method of the 

CC invention is utilized in a kit for determining or predicting whether 

CC patient would be susceptible or resistant to treatment by an agent 

CC modulating cdk activity. The invention also describes a method for 

CC utilizing individualized genetic profiles for treating diseases and 

CC disorders based on patient's response and molecular level, specialized 

CC microarrays comprising the biomarkers described, antibodies directed 

CC against the biomarkers and a cell culture model to identify biomarkers. 

CC The cdk modulator is preferably N-5 - [ [5 - (1, 1-Dimethylethyl ) -2 - 

CC oxazolyl] methyl] thio] - 2- thiazolyl -4 -piperidine carboxamide, 0.5-L- 

CC tartaric acid salt. Note: The sequence data for this patent did not form 

CC part of the printed specification, but was obtained in electronic format 

CC directly from WIPO at ftp.wipo.int/pub/published_pct_sequences. This 

CC sequence represents a biomarker used in the method of the invention. 

XX 

SQ Sequence 1297 AA ,- 

Query Match 100.0%; Score 6650; DB 9; Length 1297; 
Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1297; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MIKRLKMWKTFMDMDQDSEDEKQQYLPLALHLASEFFLRNPNKDVRLLVACCLADIFRI 60 

Ml II II II II II II II II II IN II II II II II II II II II II M II II I! II II II II 

Db 1 MIKRLKMWKTFMDMDQDSEDEKQQYLPLALHLASEFFLRNPNKDVRLLVACCLADIFRI 60 

Qy 61 YAPEAPYTSHD KLKDIFLFITRQLKGLEDTKS PQFNRY FYLLENLAWVKS YNICFELEDC 12 0 

I II M II II II II II II II II III II II II II II II M MINI II II II II II II MM 

Db 61 YAPEA PYTSHDKLKDIFLFITRQLKGLEDTKS PQFNRY FYLLENLAWVKS YNICFELEDC 12 0 

Qy 121 NEIFIQLFRTLFSVINNSHNKKVQMHMLDLMSSI IMEGDGVTQELLGS ILINLI PAHKNL 180 

I II II M II II II M II II Mill II II II MM II II MM II II M II M MM II II 
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Db 


121 


NEI FIQLFRTLFS VINNSHNKKVQMHMLDLMSSI IMEGDGVTQELLGS ILINLI PAHKNL 


180 


Qy 


± o i 


"NT VO. C T? TlT AV\7T T T( r DT\ /Pi T 1 T T7 A PTS N7I?T?MO\7T X.7T PDO CUCnT CT?TJ\7T?"nT TO T?T "ETA TT\ DU T T 
In A^or JJJjAA VIjJjAKI Vy 1 J. fiALlAJNr' r rjy V JjV JjLjRb b V bJJ lib iiri Vr IJljiy -b.br A1JJPH LiIj 








IMIIIIMIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 




Db 


181 


NKQSFDLAKVLLKRTVQTIEACIANFFNQVLVLGRSSVSDLS EHVFDLIQELFAIDPHLL 


24 0 


Qy 


O A 1 

<£ 4 1 


T OtTMTlAT T7D VT V O "NTD P" 1 D DT5 T Jin 7TJ T T A V T D Z" 1 C VT\ C 1 T\ T A TvTT) T)T T«TP^\ PD t D'ChTH TUT mi m 

ljbVMFyj_i.br KijKbNDCjJ_,i_iK_jA V VK LLAALr (jb AlJbDLA lyiSlRPLWC] Lr LLRr ND1HVPVR 


3 0 0 






1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


LSVMPQLEFKLKSNDGEERLAWRLLAKLFGSKDSDLATQNRPLWQCFLGRFNDIHVPVR 


300 


Qy 


J UI 


T DO \ TV _? A OUPT M"M U DHT A VT^T TTPV T vttd OUTIDDU A T T3UT\T7TT7 r PT T T" A A VD T\T ATX 7~KlT^\/~\ T T 

Jjilb VAr AbnL.LMjNnPDljAl\.JJij 1 Ex LK VKbnUrbbAl RHDV 1 V 1111 AAARDLALVNDC)Lb 


n r 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


LESVKFASHCLMNHPDLAKDLTEY LKVRSHDPEEAIRHDVIVTIITAAKRDLALVNDQLL 


360 


uy 


•2 CI 
j bl 


O "CWD IT DTT "H V~D TaT"D \ 7~D VC AM MOT AO T WWOT UPP AP VT?7\ A D V X 7 C? T*T T VTWT T UTVVPMCTn 

br V KiiKl LiU JvKWK VX ArjAMMLrLiAy Lix AA x LLiritjEAo A-EjAAEK Vb Wl Axj^IjIjH 1 x xLJJNb 1JJ 


A 1 A 

4z U 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


GFVRERTLDKRWRVRKEAMMGLAQLYKKYCLHGEAGKEAAEKVSWIKDKLLHIYYQNSID 


420 


Qy 


4 Zl 


T"\ TAT T ITPV TP 7\ A VT X TT) TJ"MT DT'DD'ri1UtV/""'T VVT "V A O T T-\T> "K77\ T 7\ T "KTD MITiT V/~> r\~K*TKAT T> <~> TTT 7T» T~ T T 

DALLVEKlr AQ YLVPHNLETEERM KCLY YLYASLDPNAVKALNEMWKCQNMLRSHVRELL 


48 0 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


DKLLVEKIFAQYLVPHNLETEERMKCLYYLYASLDPNAVKALNEMWKCQNMLRSHVRELL 


480 


Qy 


4 o 1 


HT TI VO DT Ct7 7\MPO AM DO VT UT X A V MT T">T\ DP"* V A P^Tl D\ 7 W D"NT f~W T T P 1 FIT* TP V TOCOTDTT "TO TTT 1 

JJ LiH AIJ P 1 bE AIM Lb AM r L KLM 1 1 AK NLi PD PL KALjD r V AAr JN L]V LL11U E KLK bLj LiE Itli 1 b P 1 


54 0 






1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


481 


DLHKQPTSEANCSAMFGKLMTIAKNL PD PGKAQDFVKKFNQVLGDDEKLRSQLELLISPT 


540 


Qy 




O CP7P A Tl TPtrD DT AD VT AMDVPlDTMDtPT DM X TV T?J T DDT A 11X7X1 Tf\CD AT CA T X 7 VT MMV O T TP 

Lot AvJAJJIL VKiil AR AJjAJN PALjFl INJPr LiEM VKr liliERl AP VH ID bEAl bALiV KLMN Kb IE 


C A A 

b U U 






llllllllllllllllllllllll IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


541 


CSCKQADICVREIARKLANPKQPTNPFLEMVKFLLERIAPVHIDSEAISALVKLMNKSIE 


600 


Qy 


c a i 


OTA T\T1 DDDPUO TIT^T 1 A TD CPT DT T 1/UT O PHPU T> r P CDUC APTVDPT T HPT T> TUID nn T/"T 7 A D A A T 

blAUDEEELiVbPD 1 AlRbtjliEEEK Vbbr 1HP1 br HbAE 1 YEbbLQCERMEDDAVAEAAlC} 


66 0 






III II II II II 1 III II II II III II II II II II II II II II II II II II II II II II II 




Db 


601 


GTADDEEEGVSPDTAIRSGLELLKVLSFTHPTSFHSAETYESLLQCLRMEDDKVAEAAIQ 


660 


Qy 


DDI 


TDDMTPUVTDT'nT nPTDOTT T nTT UPVA VDPTT5UPA T7A AWUPTUA T DTMVDT7PT AATDPTI 

1 r RJN 1 CjH. Al El DEPUIR^J -L El PlLiriQ KAKRCjl PHvJAaQAVHC 1HA1 r 1 NKEVQEACjlr EP 


11 0 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 




Db 


661 


I FRNTGH KI ETDL PQ IR STLI PILHQKAKRGT PHQAKQ AVHC IH AI FTNKEVQLAQIFEP 


720 


Qy 


7 21 


LSRSLNADVPEQLITPLVSLGHISML 


78 0 






Mill lllllllllllllllllll II II II II MM II II II llllllllllllllllll 




Db 


721 


LSRSLNADVPEQLITPLVSLGHISMLAPDQFASPMKSVVANFIVKDLLMNDRSTGEKNGK 


780 


Qy 


781 


LWSPDEEVSPEVLAKVQAI KLLVR WLLGMKNNQSKSANSTLRLLSAMLVSEGDLTEQKRI 


840 






1 II II II II II II MM 1! MMI M II II II M II II II II II MM MMII II MM 




Db 


781 


LWSPDEEVSPEVLAKVQAI KLLVR WLLGMKNNQSKSANSTLRLLSAMLVSEGDLTEQKRI 


840 


Qy 


8 41 


b KbDMbRLR LiAAGb A 1IV1 KLAQEPC YHEI 1 1 PEQr QLCALV1NDECYQVRQIFAQKLHKAL 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 


90 0 


Db 


841 


SKSDMSRLRLAAGSAIMKLAQEPCYHEI ITPEQFQLCALVINDECYQVRQIFAQKLHKAL 


900 


Qy 


Q A 1 

y ui 


T 7 VT TT T)T DVMA TDAT PA l7nn\77DD □ A UA DPPT T VT\TT O T TD TD D V T V r\~Kl TllV/I ATDVT T C T T T~> D\7 

V aL 1_jJ_i PJ_i E x MA 1 r AljLAKJJPVKERRAHARyCljLK.Nl bl RREY IKCjlNJ PMA1 EKLLbLLPEY 


Q C A 

y o u 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


901 


VKLLLPLEYMAIFALCAKDPVKERRAHARQCLLKNISIRREYIKQNPMATEKLLSLLPEY 


960 


Qy 




\7UT1VMTUT T A UTMD'n'PT'D C /~\ 1~\T 7T~\y^T T~5 T^l TT/DPT MD1UIT -pi X TT MTV>7PM\TPTT7\ DIVA V V IVil A PMTU'T rp 

VVP YMlHLLAHDPDr TRbQDVDQLRDI KECLWFMLE VLMTKNENNSHAFMKKMAENIKLT 


10 2 0 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

M 1 M M M M M M M M M M II II II 1 M M M M M M M M M M M M M M M 




Db 


961 


VVPYMIHLLAHDPDFTRSQDVDQLRDIKECLWFMLEVLMTKlMENNSi^FMAlKMAENIKLT 


1020 


Qy 


1021 


RDAQSPDESKTNEKLYTVCDVALCVINSKSALCNADSPKlOPVLPMKFFTQPEAlOFCNDKS 


1080 






Ml M M M M II II II II M III II M M M M II II M M M II II II II II II II II 




Db 


1021 


RDAQSPDESKTNEKLYTVCDVALCVINS KSALCNADSPKDPVLPMKFFTQPEKDFCNDKS 


1080 
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Qy 1081 YISEETRVLLLTGKPKPAGVLGAVNKPLSATGRK PYVRSTGT ETGSNINVNSELNPSTGN 1140 

I II II II II II II Mill! II III II II MM II II MM MM II II II II II II II II 

Db 10 81 YISEETRVLLLTGKPKPAGVLGAVNKPLSATGRKPY VRSTGT ETGSNINVNSELNPSTGN 114 0 

Qy 1141 RSREQSSEAAETGVSENEENPVRI ISVTPVKNIDPVKNKEINSDQATQGNISSDRGKKRT 12 00 

I II II MM II MM 11.11 II III II II MM II II II II MM M II II II II II II II 

Db 1141 RSREQSSEAAETGVSENEENPVRI ISVTPVKNIDPVKNKEINSDQATQGNISSDRGKKRT 12 00 

Qy 12 01 VTAAG AENI QQ KTDE KVDE SG PPA PS KP RRGR RP KS ES QGNATKND DLNK P I NKGR KR AA 12 60 

I II II II II II MM II II II III II II MM II MM II II II II II II II II II II II 

Db 12 01 VTAAG AENI QQ KTDE KVDE SG PPA PS KPRRGRRP KS ESQGNATKNDDLNKPI NKGRKR AA 12 60 

Qy 12 61 VGQESPGGLEAGNAKAPKLQDLAKKAAPAERQIDLQR 12 97 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 I 

Db 12 61 VGQ E S PGGL EAGN AK AP KL QD LAK KA AP AE RQ ID LQ R 12 97 



RESULT 2 
ABR58297 

ID ABR5 82 97 standard; protein; 13 37 AA . 
XX 

AC ABR5 82 97; 
XX 

DT 08-JUL-2003 (first entry) 
XX 

DE BCU072 0 protein #SEQ ID 40. 
XX 

KW Breast cancer; cytostatic; gene therapy; antisense therapy; regulated; 

KW drug discovery; clinical medicine; forensic medicine; chromosome 4pl4 . 
XX 

OS Homo sapiens . 
XX 

PN WO2003029421-A2. 
XX 

PD 10 -APR -200 3. 
XX 

PF 02-OCT-2002; 2 00 2WO-US0 31 28 7 . 
XX 

PR 03-OCT-2001; 2 00 1US- 032 6526P . 

PR 14-MAY-2002; 2 00 2US- 001 44 194 . 
XX 

PA (ORIG-) ORIGENE TECHNOLOGIES INC. 
XX 

PI Sun Z, Li X, Fan W, Kovacs KF, Jay G; 
XX 

DR WPI; 2003-381623/36. 

DR N-PSDB; ACC72 031. 
XX 

PT New isolated human differentially-regulated breast cancer polynucleotide 

PT and polypeptide, useful for diagnosing, staging, prognosticating, 

PT preventing and/or treating diseases and conditions relating to breast 

PT cancer. 

XX 

PS Claim 6; SEQ ID NO 40; 127pp + Sequence Listing; English. 
XX 

CC The invention relates to isolated polynucleotides which are 

CC differentially-regulated in breast cancer. The methods and compositions 

CC of the present invention are useful for detecting, diagnosing, staging, 

CC monitoring, prognosticating, preventing and/or treating diseases and 

CC conditions relating to breast cancer, and may be used in gene therapy or 
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CC antisense therapy. They can also be used in research, drug discovery, 

CC clinical medicine and forensic medicine. Sequences -given in records 

CC ABR58278-ABR58323 and ABR583 46 -ABR58362 represent polypeptides encoded by 

CC polynucleotides of the invention that are differentially- regulated in 

CC breast cancer. NOTE: The sequence data for this patent did not form part 

CC of the printed specification, but was obtained in electronic format 

CC directly from WIPO at ftp.wipo.int/pub/published_pct_sequences 

XX 

SQ Sequence 1337 AA; 

Query Match 99.9%; Score 6643; DB 6; Length 1337; 

Best Local Similarity 99.9%; Pred . No. 0; 

Matches 1296; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

M IKRLKM WKT FMDMDQDS ED EKQ QY LP LALH LASE FF LRNPNKDVRL LVAC CLAD I FRI 60 

I II II II II II II II MM II III II II II II II II II II II II II II II II II II II II 

M IKRL KM WKT FMDMDQDS EDEKQ QY LP LALH LASE FFLRNPNKDVRL LVAC CLAD I FRI 10 0 
YAPEAPYTSHDKLKDIFLFITRQLKGLEDTKSPQ FNRY FY LLENLAWVKSYNICFELEDC 12 0 

Ml II II II M M II II II II Ml II II M II II II II II II M II II II II II II II II 

YAPEAPYTSHDKLKDIFLFITRQLKGLEDTKS PQ FNRY FY LL ENLAWVKSYNICFELEDC 16 0 
NEIFIQLFRTLFSVINNSHNKKVQMHMLDLMSSI IMEGDGVTQELLGS ILINLIPAHKNL 18 0 

II II 1 1 1 1 1 1 1 1 1 1 1 M M M I II 1 1 II 1 1 1 1 1 1 M M 1 1 1 1 1 1 M I M I M M I 1 1 II 

NEIFIQLFRTLFSVINNSHNKKVQMHMLDLMSSI IMEGDGVTQELLDSILINLI PAHKNL 22 0 
NKQSFDLAKVLLKRTVQTIEACIANFFNQVLVLGRSSVSDLSEHVFDLIQELFAIDPHLL 24 0 

I 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I II M I 1 1 1 1 1 1 M I II 1 1 M 1 1 1 II 

NKQSFDLAKVLLKRTVQTIEACIANFFNQVLVLGRSSVSDLSEHVFDLIQELFAIDPHLL 28 0 
LSVMPQLEFKLKSNDGEERLAWRLLAKLFGSKDSDLATQNRPLWQCFLGRFNDIHVPVR 30 0 

MINIMI M II I! M II II III II II 1 1 II III 1 1 1 III II 1 1 1 1 II II III I II I II 

LSVMPQLEFKLKSNDGEERLAWRLLAKLFGSKDSDLATQNRPLWQCFLGRFNDIHVPVR 34 0 
LESVKFASHCLMNHPDLAKDLTEYLKVRSHDPEEAIRHDVIVTI ITAAKRDLALVNDQLL 36 0 

I M M MMI II II II II II II III II II II II II M M II II M II II II II II II II II 

LESVKFASHCLMNHPDLAKDLTEYLKVRSHDPEEAIRHDVIVTI ITAAKRDLALVNDQLL 40 0 
G FVRE RT LD KR WR VRKE AMMGLAQ LY KK YC LH GE AGKE AAEKVS WI KDKL LH I Y YQNS ID 42 0 

IMMM II II II MM II II III II II II IIMIMIM II MM II II II II II II II 

GFVRERT LD KR WR VR KE AMMG LAQ LY KK YC LHGE AGKE AAEKVS WIKDKLLH I Y YQNS ID 46 0 
DKLLVEKIFAQYLVPHNLETEERMKCLYYLYASLDPNAVKALNEMWKCQNMLRSHVRELL 48 0 

IMMM II II II M II II II III M II II llllllllll II MM II II II II II M II 

DKLLVEKIFAQYLVPHNLETEERMKCLYYLYASLDPNAVKALNEMWKCQNMLRSHVRELL 52 0 
DLHKQPTSEANCSAMFGKLMTIAKNLPDPGKAQDFVKKFNQVLGDDEKLRSQLELLISPT 54 0 

I II M M II II II II II II II Ml II II II II II MM II MM II II II II II II II II 

DLH KQ PTSEANCSAMFGKLMT IAKNL PD PGKAQD FVKKFNQVLGDDEKLRSQLELLISPT 58 0 
CSCKQADICVREIARKLANPKQPTNPFLEMVKFLLERIAPVHIDSEAISALVKLMNKSIE 60 0 

1 1 1 1 1 1 1 1 1 1 1 I i 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CSCKQADICVREIARKLANPKQPTNPFLEMVKFLLERIAPVHIDSEAISALVKLMNKSIE 64 0 
GTADDEEEGVSPDTAIRSGLELLKVLSFTH PTSFHSAETYESLLQCLRMEDDKVAEAAIQ 66 0 

I II II II II II II II II II II III M II II II II MM II II M II II M II II II II II 

GTADDEEEGVS PDTAIRSGLELLKVLSFTH PTSFHSAETYESLLQCLRMEDDKVAEAAIQ 70 0 
I FRNTGH KI ETDL PQ IRSTLI PILHQKAKRGT PHQAKQAVHC IH AI FTNKEVQLAQ IFEP 72 0 

I II II II II II II II II II II III II II II MM MM II II II II II II II II II II II 

I FRNTGHKIETDLPQ IRSTLI PILHQKAKRGT PHQAKQAVHC IHAI FTNKEVQLAQ IFEP 76 0 
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Qy 


1 


Db 


41 


Qy 


61 


Db 


101 


Qy 


121 


Db 


161 


Qy 


181 


Db 


221 


Qy 


241 


Db 


281 


Qy 


301 


Db 


341 


Qy 


361 


Db 


401 


Qy 


421 


Db 


461 


Qy 


481 


Db 


521 


Qy 


541 


Db 


581 


Qy 


601 


Db 


641 


Qy 


661 


Db 


701 
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Qy 


721 


LSRSLNADVPEQLITPLVS LGHISMLAPDQ FASPMKSVVANFIVKDLLMNDRSTGEKNGK 


780 






1 II II II II II II II II II II III MINI II II II II II II II II II II II II II II II 




Db 


761 


LSRSLNADVPEQLITPLVSLGHISMLAPDQ FASPMKSVVANFIVKDLLMNDRSTGEKNGK 


820 


Qy 


781 


LWSPDEEVSPEVLAKVQAIKLLWWLLGMKNNQSKSANSTLRLLSAMLVSEGDLTEQKRI 


84 0 






1 II MM II II MM II II II III II II II II II II II II II II MM II II II II II II 




Db 


821 


LWS PDEEVS PEVLAKVQAIKLLVRWLLGMKNNQSKSANSTLRLLSAMLVS EGDLTEQKRI 


880 


Qy 


841 


SKSDMSRLRLAAGSAIMKLAQEPC YHEI ITPEQFQLCALVINDECYQVRQIFAQKLHKAL 


900 






1 II MM M II II II II II II III II II MM II II II II II II II II II II II II II II 




Db 


881 


SKSDMSRLRLAAGSAIMKLAQEPC YHEI ITPEQFQLCALVINDECYQVRQIFAQKLHKAL 


94 0 


Qy 


901 


VKLLLPLEYMAIFALCAKDPVKERRAHARQCLLKNISIRREYIKQNPMATEKLLSLLPEY 


960 






II M II II II M II M II II III II II MM II II II II II II II II MM II II II II 




Db 


941 


VKLLLPLEYMAIFALCAKDPVKERRAHARQCLLKNI SIRREYIKQNPMATEKLLSLLPEY 


1000 


Qy 


961 


VVPYMIHLLAHDPDFTRSQDVDQLRDIKECLWFMLEVLMTKNENNSHAFMKKMAENIKLT 


1020 






III MM II II II M II MMIM IMIMMMM II MUM II II MM II MMM 




Db 


1001 


VVP YMIHLLAHDPDFTRSQDVDQLRDIKECLWFMLEVLMTKNENNSHAFM KKMAENIKLT 


1060 


Qy 


1021 


RDAQSPDESKTNEKLYTVCDVALCVINSKSAL CNADSPKDPVLPMKFFTQPEKDFCNDKS 

1 II II M M II II II II II II III II II II II II II M II II II II II M II II II II II 


1080 


Db 


1061 


RDAQSPDESKTNEKLYTVCDVALC VI NSKSAL CNADSPKDPVLPMKFFTQPEKDFCNDKS 


1120 


Qy 


1081 


YISEETRVLLLTGKPKPAGVLGAVNKPLSATGRKPYVRSTGTETGSNINVNSELNPSTGN 


1140 






1! II II II II II II M MIMM II II II II II II II II I II M MM MM II II II 




Db 


1121 


YISEETRVLLLTGKPKPAGVLGAVNKPLSATGRKPYVRSTGT ETGSNINVNSELNPSTGN 


1180 


Qy 


1141 


RSREQSSEAAETGVS ENEENPVRI ISVTPVKNIDPVKNKEINSDQATQGNISSDRGKKRT 


1200 






Ml MM M II M II MIIIIMI II IIMIIMM II II II II II MMMMIIMM 




Db - 


1181 


RSREQSSEAAETGVS ENEENPVRI ISVTPVKNIDPVKNKEINSDQATQGNISSDRGKKRT 


1240 


Qy 


1201 


VTAAGAENIQQ KTDEKVDESGPPAPSKPRRGRRPKSESQGNATKNDDLNKPINKGRKRAA 


1260 






1 M II II II II M II II MMIM II II M II II II II II II II II II MM II II II II 




Db 


1241 


VTAAGAENI QQ KTDE KVDE SG PPA PS KPRRGRRP KS ES QGNATKNDDLNKPI NKGRKRAA 


1300 


Qy 


1261 


VGQ ES PGGLEAGNAKAP KLQD LAK KAAP AERQ ID LQR 12 97 








Ml MINI II MM II II Mill IMIMMMM 1 




Db 


1301 


VGQ E S PG GL EAGN AK AP KL QD LAK KAAP AE RQIDLQR 1337 





RESULT 3 
ABG72803 

ID ABG72803 standard; protein; 1297 AA . 
XX 

AC ABG7 28 03 ; 
XX 

DT 24-FEB-2003 (first entry) 
XX 

DE Human tumour supressor SCC-112 . 
XX 

KW Human; SCC-112; cancer; apoptosis; allodynia; cell proliferation; 

KW degenerative disorder; metastasis inhibition; breast cancer; causalgia; 

KW kidney cancer; bladder cancer; pancreatic cancer; colon cancer; 

KW squamous cell carcinoma; head trauma; spinal cord injury; herpes zoster; 

KW global and focal ischaemic and haemorrhagic stroke; epilepsy; neuralgia; 

KW hypoxia- induced nerve cell damage; anxiety; diabetes mellitus; 

KW cardiac arrest; spinal cord lesion; lung cancer; tumour supressor; 

KW neonatal distress; Alzheimer's disease; colon cancer; stomach cancer; 

KW multiple sclerosis; phantom limb pain; hyperalgesia; Down 1 s syndrome; 
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KW Huntington's disease; Parkinson's disease; Korsakoff's syndrome; 

KW amyotrophic lateral sclerosis; cell survival; uterine cancer. 
XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT Domain 2. .13 7 

FT /label = RhoGEF domain 

FT Domain 12 7. .651 

FT /label= N- adapt in__domain 

FT Region 16 6. .18 7 

FT /label = Leucine_Zipper 

FT Domain 24 9. .35 0 

FT /label= SKPl_domain 

FT Region 597. .617 

FT /note= "PEST sequence site" 

FT Region 858. .865 

FT /note= "Tyrosine kinase phosphorylation site" 

FT Region 92 0. .92 6 

FT /note= "Nuclear localisation signature sequence" 

FT Region 1030. .1036 

FT /note= "Tyrosine kinase phosphorylation site" 

FT Region 1143. .1663 

FT /note= "PEST sequence site" 

FT Region 1216. .1277 

FT /note= " PEST sequence site" 

FT Region 1225. .1231 

FT /note= "Nuclear localisation signature sequence" 

FT Region 1227. .123 0 

FT /note= "Nuclear localisation signature sequence" 

FT Region 1228. .1234 

FT /note= "Nuclear localisation signature sequence" 

FT Region 1232. .1235 

FT /note= "Nuclear localisation signature sequence" 

FT Region 1251. .1257 

FT /note= "Nuclear localisation signature sequence" 

FT Region 1278. .1297 

FT /label= Epitope_region 

FT /note= "Residues 1278 to 1297 are specifically claimed in 

FT claim 41" 

XX 

PN WO200281641-A2 . 
XX 

PD 17 -OCT -200 2. 
XX 

PF 08-APR-2002; 2 00 2WO-US0 10 85 0 . 
XX 

PR 06-APR-2001; 2 00 1US - 028 17 80 P . 
XX 

PA (GEOU ) UNIV GEORGETOWN. 
XX 

PI Kasid UN, Kumar D , Ahmad I; 
XX 

DR WPI; 2003-103330/09. 

DR N-PSDB; ABX14 05 6. 
XX 

PT New isolated SSC (undefined) tumor suppressor polypeptides and 

PT polynucleotides, useful for diagnosing, preventing or treating cancer or 

PT degenerative disease, e.g. Alzheimer's Disease, Huntington's disease, or 

PT multiple sclerosis. 

XX 
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PS Claim 13; Fig IB; 83pp; English. 
XX 

CC The invention relates to a new isolated polypeptide SCC-112 . The SCC-112 

CC polypeptides and polynucleotides are useful for diagnosing, preventing or 

CC treating cancer (e.g. breast cancer, kidney cancer, bladder cancer, 

CC pancreatic cancer, colon cancer, squamous cell carcinoma, uterine cancer, 

CC stomach cancer, colon cancer, lung cancer) ; or degenerative disease or 

CC disorder (e.g. global and focal ischaemic and haemorrhagic stroke, head 

CC trauma, spinal cord injury, hypoxia -induced nerve cell damage, nerve cell 

CC damage caused by cardiac arrest or neonatal distress, epilepsy, anxiety, 

CC diabetes mellitus, multiple sclerosis, phantom limb pain, causalgia, 

CC neuralgias, herpes zoster, spinal cord lesions, hyperalgesia, allodynia, 

CC Alzheimer's disease, Huntington's disease, Parkinson's disease, 

CC amyotrophic lateral sclerosis, Down's syndrome and Korsakoff's syndrome). 

CC The polypeptides and polynucleotides are also useful for inducing 

CC apoptosis in cancer cells, increasing survival or proliferation of a 

CC cell, or inhibiting cancer cell, proliferation and/or metastasis in a 

CC cancer patient. The polynucleotides can be used as probes to detect 

CC complementary nucleotide sequences, or as primers to obtain additional 

CC copies of the polynucleotides. SSC-112 may also be used for identifying 

CC drugs for treatment of cancers . The present sequence represents the amino 

CC acid sequence of human tumour supressor SCC-112 

XX 

SQ Sequence 1297 AA; 

Query Match 99.5%; Score 6618; DB 6; Length 1297; 

Best Local Similarity 99.7%; Pred. No. 0; 

Matches 1293; Conservative 0; Mismatches 4; Indels 0; Gaps 0; 



Qy . 


l 


MIKRLKMWKT FMDMDQDSEDEKQQYLPLALHLASEFFLRNPNKDVRLLVACCLADIFRI 


60 






Ml MINI MINI MM II III IIIIIMIII IIIIIMIIMI II II IMIII II II 




Db 


l 


MIKRLKMWKTFMDMDQDSEDEKQ QYLPLALHLASEFFLRNPNKDVRLLVACCLADIFRI 


60 


Qy 


61 


YAPEAPYTSHD KLKDIFLFITRQLKGLEDTKSPQ FNRYFYLLENLAWVKSYNICFELEDC 


120 






1 II II MM M II II II M M Ml MMMM II II II II M II M II II II MM II II 




Db 


61 


YAPEAPYTSHD KLKDIFLFITRQLKGLEDTKSPQ FNRYFYLLENLAWVKSYNICFELEDC 


120 


Qy 


121 


NEIFIQLFRTLFSVINNSHNKKVQMHMLDLMSSI IMEGDGVTQELLGSILINLI PAHKNL 


180 






1 II II MM M II II II M II Ml IMIII M II II I! 1 M 1 1 1 1 1 1 1 II 1 1 1 1 1 M M 




Db 


121 


NEIFIQLFRTLFSVINNSHNKKVQMHMLDLMSSI IMEGCGVTQELLGSILINLI PAHKNL 


180 


Qy 


181 


NKQSFDLAKVLLKRTVQTIEACIANFFNQVLVLGRSSVSDLSEHVFDLIQELFAIDPHLL 


240 






MIMIIII MMMIIMM III Illlllllll IIIIIIMI II II II II M II II M 




Db 


181 


NKQSFDLAKVLLKRTVQTIEACIANFFNQVLVLGRSSVSDLSEGVFDLIQELFAIDPHLL 


240 


Qy 


241 


LSVMPQLEFKLKSNDGE ERLAWRLLAKLFGSKDSDLATQNR PLWQCFLGRFNDIHVPVR 


300 






1 MM II II II II II II II M III IMIII II M II II II II II II II M II II II M II 




Db 


241 


LSVMPQLEFKL KSNDGE ERLAWRLLAKLFGSKDSDLATQNR PLWQCFLGRFNDIHVPVR 


300 


Qy 


301 


LESVKFASHCLMNHPDLAKDLTEYLKVRSHDPEEAIRHDVIVTI ITAAKRDLALVNDQLL 


360 






1 MM MM M II II M M M Ml MMMM II II II II M M M M M II M II II II 




Db 


301 


LESVKFASHCLMNHPDLAKDLTEYLKVRSHDPEEAIRHDVIVTI ITAAKRDLALVNDQLL 


360 


Qy 


361 


GFVRERTLD KRWRVRKEAMMGLAQ LY KKYCLHGEAGKEAAEKVSWIKDKLLHIYYQNSID 


420 






1 II II MM II II II II II II Ml MMMM II II II II II M II II II II M II II M 




Db 


361 


GFVRERTLDKRWRVRKEAMMGLAQLYKKYCLHGEAGKEAAEKVSWI KDKLLHIYYQNS ID 


420 


Qy 


421 


DKLLVEKIFAQYLVPHNLETEERM KCLYYLYASLDPNAVKALNEMWKCQNMLRSHVRELL 


480 






1 MM MM M II II M II M Ml MMMM II II II II M M M M M II M II II II 




Db 


421 


DKLLVEKIFAQYLVPHNLETEERM KCLY YLYASLDPNAVKALNEMWKCQNMLRSHVRELL 


480 
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Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 



4 81 DLHKQPTSEANCSAMFGKLMT IAKNL PDPGKAQDFVKKFNQVLGDDEKLRSQLELLIS PT 54 0 

I II MM II II MM II II II III II II II II II II II INI II II II II II II II MM 

4 81 D LH KQ PT S E AN CS AM FG KL MT I AK NL PD PG KAQD FV KK FN Q V LG DD EK LR SQ LE LL I S PT 54 0 



541 CSCKQADICVREIARKLANPKQPTNPFLEMVKFLLERIAPVHIDSEAISALVKLMNKSIE 

I M 1 1 II II II II II M 1 1 II I II II II M II 1 1 II II 1 1 1 1 II II II M II 1 1 1 1 II 1 1 

541 CSCKQADICVREIARKLANPKQPTNPFLEMVKFLLERIAPVHIDSEAISALVKLMNKSIE 



601 



601 



661 



661 



721 



721 



781 



781 



841 



841 



901 



901 



961 



961 



GTADDEEEGVSPDTAIRSGLELLKVLSFTH PTSFHSAETYESLLQCLRMEDDKVAEAAIQ 

II II II II II II II II II II III II II II II II II II II II II II II II II II II II II 
GTADDEEEGVSPDTAIRSGLELLKVLSFTH PTSFHSAETYESLLQCLRMEDDKVAEAAIQ 

IFRNTGHKIETDLPQIRSTLI PILHQKAKRGTPHQAKQAVHCIHAI FTNKEVQLAQIFEP 

I M MM II II II II II II M Mill II II II M II II II II II II M II II II II II I 

IFRNTGHKIETDLPQIRSTLI PILHQKAKRGTPHQAKQAVHCIHAI FTNKEVQLAQIFEP 
LSRSLNADVPEQLITPLVS LGHISMLAPDQFASPMKSVVANFIVKDLLMNDRSTGEKNGK 

I II II II II II II M II II M III II II MM II II M II II II II II II II M II II II 

LSRSLNADVPEQLITPLVS LGHISMLAPDQFASPMKSVVANFIVKDLLMNDRSTGEKNGK 
LWS PDEEVS PE VLAKVQAI KL LVRWL LGMKNNQSKSANSTLRLLSAMLVSEGDLTEQKRI 

1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 ! I 1 1 

LWS PDEEVS PE VLAKVQAI KLLVRWLLGMKNNQSKS ANSTLRLLSAMLVSEGDLTEQKRI 
SKSDMSRLRLAAGSAIMKLAQEPC YHEI ITPEQFQLCALVINDECYQVRQIFAQKLHKAL 

1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 i F 1 1 1 1 1 1 

SKSDMSRLRLAAGSAIMKLAQEPC YHEI ITPEQFQLCALVINDECYQVRQIFAQKLHKAL 
VKLLLPLEYMAIFALCAKDPVKERRAHARQCLLKNISIRREYIKQNPMATEKLLSLLPEY 

I II II II II M II M II II II Ml II II II II II II II M II II II II II II II II M I 

VKLLLPLEYMAIFALCAKDPVKERRAHARQCLLKNISIRREYIKQNPMATEKLLSLLPEY 



600 



600 



660 



660 



720 



720 



780 



780 



840 



840 



900 



900 



960 



960 



VVPYMIHLLAHDPDFTRSQDVDQLRDIKECLWFMLEVLMTKNENNSHAFMKKMAENIKLT 10 2 0 

I II M II M II II M II M II Ml II II II II M II II II M II II II II II M II II II 

VVPYMIHLLAHDPDFTRSQDVDQLRDIKECLWFMLEVLMTKNENNSHAFMKKMAENIKLT 10 20 



10 21 RDAQSPDESKTNE KLYTVCDVALC VINSKSALCNADSPKDPVLPMKFFTQPEKDFCNDKS 10 80 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I ! 1 1 MMM 

10 21 RDAQS PD ES KTNEKL YT VC DVALC VI NS KS AL CNAD SP KD PVLPMK FFTQ PE DKFCND KS 10 80 
10 81 Y IS EETRVLLLTGKP KP AG VLGAVNKPL SATGRKPY VRSTGTETGSNI NVNS ELNPSTGN 114 0 

1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

10 81 Y IS EETRVLLLTGKP KPAGVLGAVNKPLSATGRKPYVRSTGTETGSNI NVNS ELNPSTGN 114 0 
1141 RSREQSS EAAETGVS ENEENP VRI I S VT PVKNID PVKNKE INSDQATQGNI S SDRGKKRT 12 00 

1 1 1 II 1 1 M M II 1 1 1 1 II 1 1 1 II II 1 1 II M 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 1 1 M II 1 1 

1141 RSREQSS EAAETGVS ENEENP VRI I SVT PVKNID PVKNKE IN SDQATQGN IS SDRGKKRT 12 00 
12 01 VTAAG AENI QQ KTDE KVDE SG PPA PS KPRRGRRP KS ES QGNATKNDDLNK PI NKGRKRAA 12 60 

III II II II II II II II II II III II II II M II II II M II II M II II II II II II II 

12 01 VTAAG AENI QQ KTDE KVDE SG PPA PS KPRRGRRPKS ES QGNATKNDDLNK PI NKGRKRAA 12 60 
12 61 VGQ ES PGGL EAGN AK AP KL QD LAK KAAP AE RQ ID LQ R 12 97 

III M II II M II II II M Mill II II II II II Ml 

12 61 VGQ ES PGGL EAGN AK AP KL QD LAK KAAP AE RQ ID LQ R 12 97 



RESULT 4 

ABP64685 

ID ABP64685 

XX 



standard; protein; 919 AA. 
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AC ABP64685; 
XX 

DT 25-FEB-2003 (first entry) 
XX 

DE Human protein SEQ ID 345. 
XX 

KW Human; expressed sequence tag; EST; haematopoietic disorder; 

KW central nervous system disease; viral infection; 

KW peripheral nervous system disease; non-healing wound; infectious disease; 

KW immune deficiency; immune disorder; bacterial infection; allergy; cancer; 

KW fungal infection; autoimmune disorder; coagulation disorder; nootropic; 

KW antiallergic; antiinflammatory; immunosuppressive; neuroprotective; 

KW cytostatic; haemostatic; virucide; antibacterial; fungicide; 

KW immunostimulant ; cerebroprotective . 

XX 

OS Homo sapiens . 
XX 

PN WO200259260-A2 . 
XX 

PD 01 -AUG -200 2. 
XX 

PF 16-NOV-2001; 2 00 1WO-US042 95 0 . 
XX 

PR 17-NOV-2000; 2 00 0US - 007 14 93 6 . 
XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Tang YT, Goodrich RW, Liu C, Zhou P, Asundi V, Zhang J, Zhao QA; 

PI Ren F, Xue AJ, Yang Y, Wehrman T, Drmanac RT; 

XX 

DR WPI; 2002-590824/63. 

DR N-PSDB; ABQ99271. 
XX 

PT New isolated polynucleotide, useful in research, diagnostic or 

PT therapeutic methods, e.g. preventing or treating disorders involving 

PT aberrant protein expression or biological activity. 

xx 

PS Claim 20; SEQ ID NO 34 5; 3 94pp; English. 
XX 

CC The present invention relates to novel human coding sequences (ABQ99268- 

CC ABQ99608) and proteins ( ABP646 82 - ABP65 02 2 ) . The sequences are useful in 

CC therapeutic, diagnostic and research methods. The polynucleotides may be 

CC used in the field of molecular biology as hybridisation probes, primers 

CC for PCR, for chromosome and gene mapping, for the recombinant production 

CC of protein, or in generation of anti-sense DNA or RNA. The 

CC polynucleotides are useful in diagnostics as expressed sequence tags 

CC (ESTs) for identifying expressed genes or for physical mapping of the 

CC human genome. The proteins may be used as molecular weight markers, or as 

CC nutritional sources or supplements. The proteins may be used to maintain 

CC and expand cell population in a t ot ipotential or pluripotential state 

CC useful for re-engineering damaged or diseased tissues, transplantation, 

CC manufacture of bio-pharmaceuticals or the development of bio-sensors. The 

CC polynucleotides and proteins are useful for preventing, treating or 

CC ameliorating disorders involving aberrant protein expression or 

CC biological activity, e.g. haematopoietic disorders, central/peripheral 

CC nervous system diseases, mechanical and traumatic disorders, non-healing 

CC wounds, immune deficiencies and disorders, infectious diseases caused by 

CC viral, bacterial or fungal infection, autoimmune disorders, allergic 

CC reactions and conditions, coagulation disorders, or cancer. The 

CC polynucleotide sequences of the invention were assembled from ESTs 

CC isolated mainly by sequencing by hybridisation, and in some cases, 
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CC sequences obtained from one or more public databases. Note: The sequence 

CC data for this patent did not form part of the printed specification, but 

CC was obtained in electronic format directly from WIPO at 

CC ftp. wipo . int/pub/published_pct_sequences 
XX 

SQ Sequence 919 AA; 



Query 


Match 


70.8%; Score 4711; DB 5; Length 919; 




Best 


Local ; 


Similarity 100.0%; Pred. No. 0; 




Matches 919; Conservative 0; Mismatches 0; Indels 0; Gaps 


0; 


Qy 


379 


MMGLAQLYKKYCLHGEAGKEAAEKVS WI KDKLLH IY YQNS IDDKLLVEKI FAQYLVPHNL 


438 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1 


M MG LAQL YK KY CL HG EA GKEAAEK VS WI KD KL LH I Y YQNS IDDKLLVEKI FAQYLVPHNL 


60 


Qy 


439 


ETEERMKCLYYLYAS LD PNAVKALNEMWKCQNMLRSHVRELLDLHKQPTSEANC SAMFGK 


498 






1 II II II II II MM II IIIMII II II II II II II II II II II II II MM llllllll 




Db 


61 


ETEERMKCLYYLYAS LDPNAVKALNEMWKCQNMLRSHVRELLDLHKQPTSEANCSAMFGK 


120 


Qy 


4 99. 


LMT IAKNLPDPGKAQDF VKKFNQV LGDD EKLR SQ LE LL I S PT CS CKQADI CVRE IARKLA 


558 






1 1 1 1 1 M M 1 1 1 1 M M M M M 1 M M 1 1 M 1 1 1 1 M 1 1 1 1 II M 1 1 M 1 1 1 1 1 1 1 1 1 1 




Db 


121 


LMT IAKNLPDPGKAQDFVKKFNQVLGDD EKLR SQLELL I SPTCS CKQADI CVRE IARKLA 


180 


Qy 


559 


NPKQPTNPFLEMVKFLLERIAPVHIDSEAI SALVKLMNKSIEGTADDEEEGVSPDTAIRS 


618 






1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 




Db 


181 


NPKQPTNPFLEMVKFLLERIAPVHIDSEAI SALVKLMNKSIEGTADDEEEGVSPDTAIRS 


240 


Qy 


619 


GLELLKVLSFTHPTSFH SAETYES LLQCLRMEDDKVAEAAIQIFRNTGHKIETDLPQIRS 


678 






1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 M 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M II 1 1 




Db 


241 


GLELLKVLSFTHPTSFH SAETYES LLQCLRMEDDKVAEAAIQIFRNTGHKIETDLPQIRS 


300 


Qy 


6 79 


TLI PILHQKAKRGTPHQAKQAVHC IHAIFTNKEVQLAQIFEPLSRSLNADVPEQLITPLV 


738 






M 1 M 1 1 1 1 M II 1 1 1 1 LI 1 1 M 1 II 1 1 1 1 II II 1 1 1 1 II 1 1 II II II 1 1 1 1 II 1 1 M II 




Db 


301 


TLIPILHQKAKRGTPHQAKQAVHCIHAI FTNKEVQLAQIFEPLSRSLNADVPEQLITPLV 


360 


Qy 


739 


SLGHISMLAPDQFASPMKSWANFIVKDLLMNDRSTGEKNGKLWSPDEEVSPEVLAKVQA 


798 






M 1 1 1 II 1 1 M II II 1 1 1 1 II 1 1 II 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 II M 1 1 II 1 1 II 1 1 1 1 

* ■ i ■ < ii ii iiiiiiii ii i i i i f ii ii iiiiiiii ii ii it ii ii itii ii ii ii i i i i 




Db 


361 


SLGHISMLAPDQFAS PMKS WANFIVKDLLMNDRSTGEKNGKLWSPDEEVSPEVLAKVQA 


420 


Qy 


799 


I KL LVRWLLGM KNNQ SK SANS TLR LL SAML VS EGDL TE QKRI SK SDMS RLRL AAGS AI MK 


858 






1 1 1 II 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 Ml 1 




Db 


421 


IKLLVRWLLGMKNNQSKSANSTLRLLSAMLVSEGDLTEQKRISKSDMSRLRLAAGSAIMK 


480 


Qy 


859 


L AQ EP CYHE 1 1 TPEQ FQLC AL VINDE CYQVRQ I F AQKLHKAL VKLLLP LE YMAI FALCAK 


918 






M 1 M 1 1 1 1 II II 1 1 II 1 1 1 1 II 1 II M 1 1 M II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 




Db 


481 


LAQEP CYHE I I TPEQ FQLC AL VINDE CYQVRQ I FAQ KLHKALVKLLLPLE YMAI FALCAK 


54 0 


Qy 


919 


DPVKE RRAHARQCLLKNISIRREYIKQNPMATEKLLSLLPEYWPYMIHLLAHDPDFTRS 


978 






III II II II II II II II II II III II II II II II II II MM MM II MM II II MM 




Db 


541 


DPVKERRAHARQCLLKNISIRREY IKQNPMATEKLLSLLPEYWPYMIHLLAHDPDFTRS 


600 


Qy 


979 


QDVDQ LRDI KECLWFMLEVLMTKNENNSHAFM KKMAENIKLTRDAQSPDESKTNEKLYTV 


1038 






II MMM II Mil MM Mill II IIMIIMMMII MM II MM MM MIIM 




Db 


601 


QDVDQ LRDI KECLWFMLEVLMTKNENNSHAFM KKMAENIKLTRDAQSPDESKTNEKLYTV 


660 


Qy 


1039 


CDVALCVINSKSALCNADSPKDPVLPMKFFTQPEKDFCNDKSYISEETRVLLLTGKPKPA 


1098 






II II II II II II II II II II Ml II II II II II II II II 1! M II II II II II II II II 




Db 


661 


CDVALCVINSKSALCNADSPKDPVLPMKFFTQPEKDFCNDKSYISEETRVLLLTGKPKPA 


720 


Qy 


1099 


GVLGAVNKPLSATGRKPYVRSTGT ETGSNINVNSELNPSTGNRSREQSSEAAETGVSENE 


1158 



Mllllllllllllllllllllll IMIIIMIIIIIIMIIIMMIIMIIIMIIII 
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SCORE Search Results Details for Application 10679580 
and Search Result us-10-679-580a-2.rai. 



Score Home Pag e Retrieve Application List SCORE System Overview SCORE FAQ Comments / Sug g estio ns 

This page gives you Search Results detail for the Application 10679580 and Search Result us-10-679-580a- 

2.rai. 

start 

Go Back to pr evious page 

GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



ONI protein - protein search, using sw model 



Run on : 



Title : 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



July 21, 2006, 21:11:59 ; Search time 55 Seconds 

(without alignments) 
2064.131 Million cell updates/sec 

US-10-679 -580A-2 
6650 

1 MIKRLKMWKT FMDMDQDSE KLQDLAKKAAPAERQIDLQR 1297 

BLOSUM6 2 

Gapop 10.0 , Gapext 0.5 
650591 seqs, 87530628 residues 



650591 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Issued_Patents_AA: * 

1 : /EMC_Celerra_SIDS3/ptodata/2/iaa/5_COMB.pep: * 

2 : /EMC_Celerra_SIDS3/ptodata/2/iaa/6_COMB.pep: * 

3 : /EMC_Celerra_SIDS3/ptodata/2/iaa/7_C0MB.pep: * 

4 : /EMC_Celerra_SIDS3/ptodata/2/iaa/H_C0MB.pep: * 

5 : /EMC_Celerra_SIDS3/ptodata/2/iaa/PCTUS_C0MB.pep: * 

6 : /EMC_Celerra_SIDS3 /ptodata/2 /iaa/RE_COMB .pep : * 

7 : /EMC_Celerra_SIDS3 /ptodata/2 /iaa/backf ilesl . pep : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB 



ID 



Description 



1 4668.5 70.2 1391 2 US - 09 -512 -581D-2 

2 4352.5 65.5 1346 2 US - 09 - 94 9 - 016 - 995 9 



Sequence 2, Appli 
Sequence 9 959, Ap 
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3 


702 


10 


. 6 


312 


2 


US 


-09 


-270 


-767-45520 


Sequence 


45520, A 


4 


377 


5 


. 7 


92 


2 


US 


-09 


-513 


-999C-6892 


Sequence 


6892, Ap 


5 


374 


5 


. 6 


309 


2 


US 


-09 


-270 


-767 -45536 


Sequence 


45536, A 


6 


328.5 


4 


. 9 


752 


2 


us 


-09 


-248 


-796A-154 45 


Sequence 


15445, A 


7 


314 


4 


. 7 


101 


2 


us 


-09 


-471 


-276 -140 8 


Sequence 


14 0 8, Ap 


8 


28 4 


4 


. 3 


23 5 


2 


us 


-09 


-270 


-767 -432 36 


Sequence 


43236, A 


9 


219 


3 


. 3 


265 


2 


us 


-09 


-248 


-796A- 154 46 


Sequence 


15446, A 


1 0 


199 


3 


. 0 


2663 


2 


us 


-09 


-538 


-092 -1252 


Sequence 


1252, Ap 


11 


198 


3 


. 0 


2662 


2 


us 


-09 


-595 


-684B-31 


Sequence 


31, Appl 


1 2 


195 . 5 


2 


. 9 


3 24 8 


1 


us 


-08 


-353 


-700-1 


Sequence 


1 , Appl i 


1 3 


19 5 . 5 


2 


. 9 


3 24 8 


5 


PCT-US9 5 - 


162 16-1 


Sequence 


1, Appli 


14 


191.5 


2 


. 9 


2482 


1 


US 


-08 


-328 


-254 -6 


Sequence 


6, Appli 


15 


190.5 


2 


. 9 


3210 


2 


US 


-09 


-538 


-092 -1154 


Sequence 


115 4, Ap 


1 6 


18 9 


2 


. 8 


1051 


2 


us 


-08 


-965 


-762 -14 


Sequence 


14, Appl 


1 7 


18 9 


2 


. 8 


1051 


2 


us 


-09 


-911 


-927 -14 


Sequence 


14, Appl 


18 


18 9 


2 


. 8 


1051 


2 


us 


-09 


-911 


-882 -14 


Sequence 


14, Appl 


1 9 


18 9 


2 


. 8 


1051 


2 


us 


-09 


-911 


-888 -14 


Sequence 


14, Appl 


2 0 


181 . 5 


2 


. 7 


169 


2 


us 


-09 


-270 


-767 -333 70 


Sequence 


33370, A 


2 1 


181 . 5 


2 


. 7 


169 


2 


us 


-09 


-270 


-767 -485 87 


Sequence 


48587, A 


22 


18 1 . 5 


2 


. 7 


2265 


2 


us 


tIO 


-144 


-198 -35 


Sequence 


35, Appl 


23 


180 


2 


. 7 


2733 


2 


us 


-09 


-949 


-016 -114 33 


Sequence 


11433, A 


2 4 


17 7.5 


2 


. 7 


3 878 


2 


us 


-09 


-914 


-259 -11 


Sequence 


11, Appl 


2 5 


177 


2 


. 7 


2158 


2 


us 


-10 


-144 


-198 -34 


Sequence 


34, Appl 


2 6 


177 


2 


. 7 


2 697 


2 


us 


-10 


-144 


-198 -12 


Sequence 


12, Appl 


2 7 


17 6.5 


2 


. 7 


197 9 


2 


us 


-09 


-949 


-016 -646 8 


Sequence 


6 4 6 8, Ap 


2 8 


176 . 5 


2 


. 7 


2 047 


2 


us 


-09 


-949 


-016 -7404 


Sequence 


7 4 0 4, Ap 


2 9 


17 6 


2 


. 6 


1055 


2 


us 


-09 


-949 


-016 -977 6 


Sequence 


9 77 6, Ap 


3 0 


175 


2 


. 6 


3 696 


2 


us 


-09 


-134 


-001C-508 0 


Sequence 


50 80, Ap 


3 1 


174 


2 


. 6 


3259 


2 


us 


-09 


-949 


-016 -650 7 


Sequence 


6 5 0 7, Ap 


3 2 


172 


2 


. 6 


994 


2 


us 


-09 


-949 


-016 -677 9 


Sequence 


6 779, Ap 


3 3 


172 


2 


. 6 


1173 


2 


us 


-09 


-248 


-796A-193 13 


Sequence 


19313, A 


3 4 


168 


2 


. 5 


2704 


2 


us 


-09 


-53 8 


-092 -1260 


Sequence 


1260, Ap 


3 5 


167 . 5 


2 


. 5 


2954 


2 


us 


-09 


-150 


-867 -1 


Sequence 


1, Appli 


3 6 


16 7 . 5 


2 


. 5 


2 954 


3 


us 


-09 


-724 


-584 -1 


Sequence 


1, Appli 


3 7 


164.5 


2 


. 5 


2101 


1 


us 


-08 


-466 


-390 -4 


Sequence 


4, Appli 


38 


164 . 5 


2 


. 5 


2101 


1 


us 


-08 


-470 


-950 -4 


Sequence 


4 , Appl i 


3 9 


164 . 5 


2 


. 5 


2 101 


1 


us 


-08 


-467 


-781 -4 


Sequence 


4 , Appl i 


4 0 


164 . 5 


2 


. 5 


2101 


1 


us 


-08 


-195 


-487 -4 


Sequence 


4 , Appl i 


41 


164. 5 


2 


. 5 


2101 


1 


us 


-08 


-483 


-924 -4 


Sequence 


4, Appli 


42 


164.5 


2 


.5 


2101 


2 


us 


-09 


-452 


-294 -1 


Sequence 


1 , Appl i 


43 


164. 5 


2 


.5 


2101 


2 


us 


-09 


-296 


-662 -32 


Sequence 


32, Appl 


44 


164 . 5 


2 


. 5 


2101 


5 


PCT-US9 3 - 


06160-4 


Sequence 


4, Appli 


45 


164 . 5 


2 


.5 


2115 


2 


us 


-09 


-296 


-662 -33 


Sequence 


33, Appl 



ALIGNMENTS 



RESULT 1 

US-09-512-581D-2 

; Sequence 2, Application US/09512581D 

; Patent No. 6994992 

; GENERAL INFORMATION: 

; APPLICANT: Soto, Ana M. 

APPLICANT: Sonnensche in , Carlos 
; APPLICANT: Geek, Peter 
; APPLICANT: Szelei, Jozsef 

TITLE OF INVENTION: NOVEL ANDROGEN- INDUCED SUPPRESSOR OF CELL PROLIFERATION AND USES 

TITLE OF INVENTION: THEREOF 

FILE REFERENCE: 34 724-026 

CURRENT APPLICATION NUMBER: US/0 9/5 12 , 5 81D 
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CURRENT FILING DATE: 2000-02-24 
PRIOR APPLICATION NUMBER: US 6 0/121,4 61 
PRIOR FILING DATE: 1999-02-24 
NUMBER OF SEQ ID NOS : 72 
SOFTWARE: Patentln version 3.3 
SEQ ID NO 2 
LENGTH: 13 91 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-512 -581D-2 

Query Match 70.2%; Score 4668.5; DB 2; Length 1391; 

Best Local Similarity 69.0%; Pred . No. 0; 

Matches 908; Conservative 160; Mismatches 199; Indels 49; Gaps 11; 

Qy 1 MIKRLKMWKTFMDMDQDSEDEKQQYLPLALHLASEFFLRNPNKDVRLLVACCLADIFRI 60 

I- II II II II II II I I II h lh || II II II h II h :| I II I I I I I I I I I II I I I 
Db 31 MVRRLKMWKTFMDMDQDSEEEKE LYLNLALHLASDFFLKHPGKDVRLLVACCLADIFRI 90 

Qy 61 YAPEAPYTSHDKLKDIFLFITRQL KGLEDT KS PQFNRY FYLLENLAWVKS YNICFELEDC 12 0 

I'M IMIMMIIIII IIMIIIIMIIIII IMIMM MINIM II I 
Db 91 YAPEAPYTS £>DKLKDIFMFITRQL KG LEDTKS PQFNRY FYLLENIAWVKSYNICFELEDS 15 0 

Qy 121 NEIFIQLFRTLFSVINNSHNKKVQMHMLDLMSSI IMEGDGVTQELLGSILINLI PAHKNL 180 

I I I I I I : II I I I I I I I 11 = 11 I I I •• I I I I I I I III Mill : : I : I h I I I I I I 
Db 151 NEIFTQLYRTLFSVINNGHNQKVHMHMVDLMSSIICEGDTVSQELLDTVLVNLVPAHKNL 210 

J 

Qy 181 NKQSFDLAKVLLKRTVQTIEACIANFFNQVLVLGRSSVSDLSEHVFDLIQELFAIDPHLL 24 0 

Ill-Mil IMM | || | II II MM MMI II II MM ||: || Ml 
Db 211 NKQAYDLAKALLKRTAQAIEPYITTFFNQVLMLG KTSISDLSEHVFDLILELYNIDSHLL 27 0 

Qy 241 LSVMPQLEFKLKSNDGEERLAVVRLLAKLFGSKDSD-LATQNRPLWQCFLGRFNDIHVPVR 300 

I M M II M I M I I ! I 1 1 I Ml IMI IMI M IMI M II II M II II II II M M 

Db 271 LSVLPQLEFKLKSNDNEERLQWKLLAKMFGAKDSELASQNKPLWQCYLGRFNDIHVPIR 33 0 

Qy 3 01 LESVKFASHCLMNHPDLAKDLTEYLKVRSHDPEEAIRHDVIVTI ITAAKRDLALVNDQLL 360 

II I 1 I I I I I I I I I I I ! I I I ! I I I I I I I I I I I I I I I I I I I i I ^ I : I I I I = M I I I I II 

Db 331 LECVKFASHCLMNHPDLAKDLTEYLKVRSHDPEEAIRHDVIVSIVTAAKKDILLVNDHLL 390 

Qy 3 61 GFVRERTLD KRWRVRKEAMMGLAQLY KKYCLHGEAGKEAAEKVSWIKDKLLHIYYQNSID 42 0 

I II II II Ml II II Ml MM II | II M||:::: || MIM Ml II II 
Db 3 91 NFVRE RTLDKRWRVRKEAMMGLAQIYKKYALQSAAGKDAAKQIAWIKDKLLHIYYQNSID 45 0 

Qy 421 DKLLVEKIFAQYLVPHNLETEERMKCLYYLYASLDPNAVKALNEMWKCQNMLRSHVRELL 480 

Ml II MM II MM II II I III II II II MM I 1 1 I M I 1 1 1 1 I M M I I MM 

Db 4 51 DRLLVERIFAQYMVPHNLETTERMKCLYYLYATLDLNAVKALNEMWKCQNLLRHQVKDLL 510 

Qy 4 81 DLHKQPTSEANCSAMFGKLMTIAKNL PD PGKAQDFVKKFNQVLGDDEKLRSQLELLISPT 54 0 

II III M: |:| hi I M I 1 1 1 1 Ml I III IMM 111=1 = 111 

Db 511 DLIKQPKTDASVKAI FSKVMVITRNL PD PGKAQDFMKKFTQVLEDDEKIRKQLEVLVSPT 570 

Qy 541 CSCKQADICVREIARKLANPKQPTNPFLEMVKFLLERIAPVHIDSEAI SALVKLMNKSIE 600 

MIMM Mill Ml I II III II M IMI II II II II II II M M II Ml MUM 

Db 571 CSCKQAEGCVREITKKLGNPKQPTNPFLEMIKFLLERIAPVHIDTESI SALIKQVNKSID 63 0 

Qy 6 01 GTADDEEEGVS PDTAIRSGLELLKVLSFTH PTSFHSAETYESLLQCLRMEDDKVAEAAIQ 660 

MIMM I I IMI II III II II II I IMIIMMM IMMMMIIMM 

Db 631 GTADDEDEGVPTDQAIRAGLELLKVLSFTHPISFHSAETFESLLACLKMDDEKVAEAALQ 690 

Qy 661 IFRNTGHKI ETDLPQIRSTLIPILHQKAKRGT PHQAKQAVHCIHAIFTNKEVQLAQIFEP 720 

IMM III I | Ml MMII I : I : I I Ml I : I I I I I I I = : I I I MINI 
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Db 6 91 I FKNTGS KI EEDF PH IRSALLPVLHHKSKKGPPRQAKYAI HC IHAI FS SKETQFAQ IFEP 750 

Qy 721 LSRSLNADVPEQLITPLVS LGHISMLAPDQFASPMKSVVANFIVKDLLMNDRSTGEKNGK 780 

I I I II I I h H Ih H II II Ihl II II IIIIIMIIII hi I 

Db 7 51 LHKSLDPSNLEHLITPLVTIGHIALLAPDQFAAPWKSWVATFIVKDLLMNDRLPGKKTTK 810 

Qy 7 81 LWS PDEEVS PEVLAKVQ AI KLLVR WLLGMKNNQS KS ANSTLRLLSAMLVS EGDLTEQKRI 84 0 

II Mill : hi II h HI II II II M Ml lllllh I hllllll 

Db 811 LWVPDEEVSPETMVKIQAIKMMVRWLLGMKNNHSKS GTSTLRLLTTILHSDGDLTEQGKI 870 

Qy 841 SKSDMSRLRLAAGSAIMKLAQEPCYHEI ITPEQFQLCALVINDECYQVRQIFAQKLHKAL 900 

II II II II II II II h II II III II II M hlllll I II II II Ihl II II h | 

Db 8 71 SKPDMSRLRLAAGSAIVKLAQEPCYHEI ITLEQYQLCALAINDECYQVRQVFAQKLHKGL 93 0 

Qy 901 VKLLLPLEYMAIFALCAKDPVKERRAHARQCLLKNI SIRREYIKQNPMATEKLLSLLPEY 960 

I II I II 1 1 I II II II II III II II II Ihl Ihhl Ihl h hllMIIIII 

Db 931 SRLRLPLEYMAICALCAKDPVKERRAHARQCLVKNINVRREYLKQHAAVSEKLLSLLPEY 99 0 

Qy 961 VVPYMIHLLAHDPDFTRSQDVDQLRDIKECLWFMLEVLMTKNENNSHAFMKKMAENIKLT 1020 

I I I I Mllllllh : I hhhhl II II hlhl llllllllhhl I I I I I 
Db 991 VVPYTIHLLAHDPDYVKVQDIEQLKDVKECLWFVLEILMAKNENNSHAFIRKMVENIKQT 1050 

Qy 1021 RDAQS PDESKTNEKLYTVCDVALCVINSKSALCNADSPKDPVLPMKFFTQPEKDFCNDKS 1080 

hi Ihh I I I I I I I E I I I : =| Ml : : I I I I I I I I hlllhhl I 
Db 10 51 KDAQGPDDAKMNE KLYTVCDVAMNIIMSKSTTYSLESPKDPVLPARFFTQ PDKNFSNTKN 1110 

Qy 10 81 YISEETRVLLLTGKPKPAGVLGAVNKPLSATGRKPYWSTGTETGSNINVNSELNPSTGN 1140 

h | : I I I I I I I I I I [ 1 1 I = |:: :|: II | =| llh 

Db 1111 YLPPEMKSFFTPGKPKTTNVLGAVNKPLSSAGKQSQTKSSRMETVS - -NASSSSNPSSPG 1168 

Qy 1141 RSREQSSEAAETGVSENEENPVRI ISVTPVKNID PVKNK 1179 

I : ■ =::] lllh hill II : . 

Db 1169 RIKGR -LDS SEMDHSENED - - YTMSSPLPGKKSDKRDDSDLVRSELEKPRGRKKTPVTEQ 1225 

Qy 1180 E - - INSDQAT- QGN I S SDRG KKRT VT AAGAEN IQQKTD EK VDESGPP 1223 

I : I I I I I hi Ih :: II hi II I 

Db 1226 EEKLGMDDLTKLVQEQKPKGSQRSRKRGHTASESDE -QQW PEEKRLKEDILENEDEQNSP 1284 

Qy 1224 APS KPRRGRRPKS ESQGNATKNDD LNKP INKGRKR AAVGQESPGGLEAGNAK 1275 

I I hi || I I : I II h I :| ::|| 

Db 1285 - PKKGKRGRPPKPLG -GGT PKEE PTMKTS KKGS KKKSGPPAPEEEEEEERQ SGNTE 1338 



RESULT 2 

US-09-949-016-9959 

; Sequence 9959, Application US/09949016 
; Patent No. 6812339 
; GENERAL INFORMATION: 

; APPLICANT: VENTER, J. Craig et al. 

; TITLE OF INVENTION: POLYMORPHISMS IN KNOWN GENES ASSOCIATED 

; TITLE OF INVENTION: WITH HUMAN DISEASE , METHODS OF DETECTION AND USES THEREOF 
FILE REFERENCE: CL0013 07 

CURRENT APPLICATION NUMBER: US/0 9/949,016 
; CURRENT FILING DATE : 20 00-04-14 
; PRIOR APPLICATION NUMBER: 60/241,75 5 

PRIOR FILING DATE: 2000-10-20 

PRIOR APPLICATION NUMBER: 60/237,768 
; PRIOR FILING DATE: 2000-10-03 
; PRIOR APPLICATION NUMBER: 60/231,498 

PRIOR FILING DATE: 2000-09-08 
; NUMBER OF SEQ ID NOS : 207012 
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Score Hom e Page Retrieve Application List SCORE System Overview SCORE FAQ Comments / Sug g estions 

This page gives you Search Results detail for the Application 10679580 and Search Result us-10-679-580a-2.rpr. 
start 



GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM protein - protein search, using sw model 
Run on: July 21, 2006, 21:06:29 



Search time 53 Seconds 
(without alignments) 
2354.588 Million cell updates/sec 



Title : US- 10 -679 -580 A- 2 

Perfect score: 6650 

Sequence: 1 MIKRLKMWKTFMDMDQDSE KLQDLAKKAAPAERQI DLQR 1297 



Scoring table: 



BLOSUM6 2 

Gapop 10.0 , Gapext 0.5 



Searched: 283416 seqs , 96216763 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



283416 



Post -processing : 



Database 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

PIR 80 :* 



pirl : * 
pir2 : * 
pir3 : * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



% 

Query 



No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


4350 


65 .4 


851 


2 


T00374 


hypothetical prote 


2 


661. 5 


9.9 


1579 


2 


T2 314 2 


hypothetical prote 


3 


652 


9.8 


1506 


2 


S52957 


bimD protein - Erne 


4 


621. 5 


9.3 


1303 


2 


E96805 


hypothetical prote 


5 


538 


8 . 1 


1277 


2 


S54451 


hypothetical prote 


6 


257 


3 . 9 


390 


2 


T43647 


hypothetical prote 


7 


256. 5 


3 .9 


780 


2 


F96840 


hypothetical prote 


8 


250 


3 .8 


990 


2 


H86293 


protein T24D18.4 [ 
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9 


220 


3 


. 3 


852 


2 


T06310 


hypothetical prote 


10 


214 . 5 


3 


. 2 


1875 


2 


S38173 


myosin- like protei 


11 


204 . 5 


3 


. 1 


1790 


2 


S67593 


transport protein 


12 


203 


3 


. 1 


1819 


2 


A7 192 8 


cag island protein 


13 


200.5 


3 


. 0 


213 9 


2 


T18296 


myosin heavy chain 


14 


199 


3 


. 0 


2663 


1 


S28261 


centromere protein 


15 


198 . 5 


3 


. 0 


1927 


2 


G64585 


cag pathogenicity 


16 


196 . 5 


3 


. 0 


1160 


2 


T13713 


beta3 protein - fr 


17 


192 


2 


. 9 


1679 


2 


S48385 


hypothetical prote 


18 


191.5 


2 


. 9 


1302 


1 


JC6009 


surface-located me 


19 


190.5 


2 


. 9 


1916 


2 


S4 6157 


RIF1 protein - yea 


20 


190 


2 


. 9 


2253 


2 


T3 033 6 


nuclear/mitotic ap 


2 1 


189 


2 


. 8 


1051 


2 


S59791 


probable membrane 


22 


188 


2 


. 8 


163 0 


2 


S6440 3 


ESP1 protein - yea 


23 


185.5 


2 


. 8 


4385 


2 


T2 904 2 


hypothetical prote 


24 


185 


2 


. 8 


2297 


2 


AB2494 


hypothetical prote 


2 5 


183.5 


2 


. 8 


172 1 


2 


T21214 


hypothetical prote 


26 


182 . 5 


2 


. 7 


2819 


2 


A9 0551 


conserved hypothet 


27 


180 . 5 


2 


. 7 


932 


2 


S65214 


probable alpha/gam 


28 


180 


2 


.7 


3225 


2 


152300 


giant in - human 


2 9 


179.5 


2 


. 7 


2346 


2 


T1382 9 


Tpr homolog - frui 


30 


179 


2 


. 7 


2116 


2 


A2 665 5 


myosin heavy chain 


31 


178 


2 


. 7 


1034 


2 


T17458 


chromosome condens 


32 


175 . 5 


2 


.6 


1642 


2 


T08880 


NMDA receptor-bind 


33 


175 . 5 


2 


. 6 


2493 


2 


S45734 


probable membrane 


34 


174 


2 


. 6 


2692 


2 


T23768 


hypothetical prote 


35 


174 


2 


. 6 


3259 


1 


A56539 


giantin - human 


36 


173 . 5 


2 


. 6 


1931 


2 


A5 923 4 


slow myosin heavy 


37 


173 


2 


. 6 


798 


2 


150479 


neurofilament medi 


3 8 


173 


2 


. 6 


1226 


2 


151617 


kinesin-like prote 


39 


172 . 5 


2 


. 6 


1837 


2 


T4 1023 


probable nuclear p 


4 0 


172 


2 


. 6 


84 5 


2 


A4 5 6 6 9 


neurofilament trip 


41 


172 


2 


.6 


1727 


2 


T50073 


myosin-like coiled 


42 


172 


2 


.6 


2481 


2 


D90011 


FmtB protein [impo 


43 


170. 5 


2 


.6 


1447 


2 


F82909 


hypothetical prote 


44 


170 . 5 


2 


.6 


1864 


2 


F86378 


protein F21J9 . 12 [ 


45 


170 . 5 


2 


.6 


2415 


1 


A33733 


spectrin alpha cha 



ALIGNMENTS 



RESULT 1 
T00374 

hypothetical protein KIAA0648 - human (fragment) 
C;Species: Homo sapiens (man) 

C;Date: Ol-Feb-1999 #sequence_revision Ol-Feb-1999 #text_change 09-Jul-2004 
C;Accession: T00374 

R;Ishikawa, K.; Nagase, T.; Suyama, M. ; Miyajima, N.; Tanaka, A.; Kotani, H.; Nomura, N.; Ohara, O 
DNA Res. 5, 169-176, 1998 

A;Title: Prediction of the coding sequences of unidentified human genes. X. The complete sequences 
A;Reference number: Z14142; MUID : 98403 88 0 ; PMID:9734811 
A; Accession : TO 03 74 

A; Status: preliminary; translated from GB/EMBL/DDB J 
A;Molecule type: mRNA 
A;Residues: 1-851 

A;Cross-references: UNIPROT :Q9Y4D4 ; UNIPARC : UPI000006D34B; EMBL : AB014 54 8 ; NID : g3327109; PIDN:BAA31- 
A;Experimental source: brain 
C;Genet ics : 
A;Note: KIAA0648 
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Query Match 6 5.4%; Score 43 50; DB 2; Length 851; 

Best Local Similarity 100.0%; Pred. No. 4.8e-234; 

Matches 851; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 447 L YY LY AS LD PNAVKALNEM WKCQNMLRS HVRE LLDLHKQPTS EANC SAMFGKLMTI AKNL 506 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 LYYLYAS LD PNAVKALNEM WKCQNMLRS HVRE LLDLHKQPTS EANC SAMFGKLMTI AKNL 60 

Qy 5 07 PDPGKAQDFVKKFNQVLGDDEKLRSQLELLISPTCSCKQADICVREIARKLANPKQPTNP 56 6 

I II II II II II II II II II II Mill II INI II II II II II II II II II II II II II II 

Db 61 PDPGKAQDF VKKFNQ VLGDDEKLRSQLELL IS PTCS CKQADI CVRE IARKLANPKQPTNP 120 

Qy 5 67 FLEMVKFLLERIAPVHIDSEAISALVKLMNKS IEGTADDEEEGVSPDTAIRSGLELLKVL 62 6 

I II II II II II II II II II II III MM II MM II II II II II II II II II MM II II 

Db 121 FLEMVKFLLER I APVHIDSEA I SALVKLMNKS IEGTADDEEEGVSPDTAIRSGLELLKVL 18 0 

Qy 627 SFTHPTSFH SAETYESLLQCLRME DDKVAEAAIQIFRNTGHKIETDLPQIRSTLIPILHQ 686 

I II II II II II II II II II II Ml II II II II II II II II II II II II II II II II II II 

Db 181 SFTHPTSFH SAETYESLLQCLRMEDDKVAEAAIQIFRNTGHKIETDLPQIRSTLIPILHQ 24 0 

Qy 687 KAKRGTPHQAKQAVHCIHAIFTNKEVQLAQIFEPLSRSLNADVPEQLITPLVSLGHISML 746 

I II II II II II II II II II M III II II II M II II II II II II II II II II II M II II 

Db 241 KAKRGTPHQAKQAVHCIHAIFTNK EVQLAQIFEPLSRSLNADVPEQLITPLVSLGHISML 300 

Qy 7 47 APDQFAS PMKS WANFI VKDLLMNDRSTGEKNGKLWSPDEEVSPEVLAKVQAIKLLVRWL 80 6 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I II I I I 

Db 3 01 APDQFAS PMKS WANFI VKDLLMNDRSTGEKNGKLW SPDEEVSPEV LA KVQAIKLLVRWL 36 0 

Qy 8 07 LGMKNNQSKSANSTLRLLSAMLVS EGDLTEQKRI SKSDMSRLRLAAGSAIMKLAQEPCYH 86 6 

I II M II II II II II II Mil III II II M II II II II II II II II MM II MM II II 

Db 3 61 LGMKNNQSKSANSTLRLLSAMLVS EGDLTEQKRI SKSDMSRLRLAAGSAIMKLAQEPCYH 42 0 

Qy 8 67 EIITPEQFQLCALVINDECYQVRQIFAQKLHKALVKLLLPLEYMAIFALCAKDPVKERRA 92 6 

I II II II II I! II M II II IMM II II II II II II II II M II M II II II II M II II 

Db 421 EIITPEQFQLCALVINDECYQVRQIFAQKLHKALVKLLLPLEYMAI FALCAKDPVKERRA 480 

Qy 927 HARQCLLKNISIRREYIKQNPMATEKLLSLLPEYWPYMIHLLAHD PDFTRSQDVDQLRD 986 

I II II II II II MM II II II Ml II II II II II II II II II II II II II II II II II II 

Db 4 81 HARQCLLKNISIRREYIKQNPMATEKLLSLLPEYWPYMIHLLAHD PD FTRSQDVDQLRD 54 0 

Qy 9 87 I KECLWFMLEVLMTKNENNSHAFM KKMAENIKLTRDAQSPDESKTNEKLYTVCDVALCVI 1046 

I II II II II II II M II II II Ml MM M II II II II II M II M II II II II II II II 

Db 541 I KECLWFMLEVLMTKNENNSHAFM KKMAENIKLTRDAQSPDESKTNEKLYTVCDVALCVI 60 0 

Qy 1047 NSKSALCNADSPKDPVL PMKFFTQPEKDFCNDKSYISEETRVLLLTGKPKPAGVLGAVNK 1106 

I II II II II II II M II Mil Ml MM II II II II II II II II M II II MM II II II 

Db 601 NSKSALCNADSPKDPVL PMKFFTQPEKDFCNDKS YI SEETRVLLLTGKPKPAGVLGAVNK 660 

Qy 1107 PLSATGRKPYVRSTGTETGSNINVNSELNPSTGNRSREQSSEAAETGVSENE ENPVRI IS 1166 

I II II II M II II M M II II III II II II II II II II II li M II II II II II II II II 

Db 6 61 PLSATGRKPYVRSTGTETGSNINVNSELNPSTGNRSREQSSEAAETGVSENE ENPVRI IS 72 0 

Qy 1167 VTPVKNIDPVKNKEINSDQATQGNISSDRGKKRTVTAAGAENIQQKTDEKVDESGPPAPS 1226 

1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 9 1 1 1 1 1 1 1 1 1 1 1 1 ) I 1 1 I ! 1 1 1 1 1 1 1 1 i ! 1 1 1 1 

Db 7 21 VTPVKNIDPVKNKEINSDQATQGNISSDRGKKRTVTAAGAENIQQKTDEKVDESGPPAPS 78 0 

Qy 12 27 KPRRGRRPKSESQGNATKNDDLNKPINKGRKRAAVGQESPGGLEAGNAKAPKLQDLAKKA 12 86 

I II II MM II II II MM M III MM II II II M II II II II II M II II II II II II 

Db 7 81 KPRRGRRPKSESQGNATKNDDLNKPINKGRKRAAVGQESPGGLEAGNAKAPKLQDLAKKA 84 0 

Qy 12 87 APAERQIDLQR 1297 
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IIIMIIIIII 

Db 841 APAERQIDLQR 851 

RESULT 2 
T2314 2 

hypothetical protein H38K22.1 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revi sion 15-Oct-1999 #text_change 09-Jul~2004 
C;Accession: T23142; T24139 
R,*Barlow, K. 

submitted to the EMBL Data Library, June 1998 
A/Reference number: Z19695 
A;Accession: T23142 

A;Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type : DNA 
A; Residues : 1-157 9 

A/Cross-references: UNIPROT : Q9XTF6 ; UNIPARC : UPI0000078215 ; EMBL : AL02 4 4 99 ; PIDN : CAA19710 . 1 ; GSPDB:G1 
A/Experimental source: clone H38K22 
R;Ainscough, R. 

submitted to the EMBL Data Library, August 1995 
A/Reference number: Z19843 
A;Accession: T24139 

A;Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type : DNA 
A;ResidueS: 1-1579 

A/Cross-references: UNIPARC : UPI0000078215; EMBL:Z50874; PIDN : CAA90774 . 1 ; GSPDB : GN0002 1 ; CESP:H38K2: 

A;Experimental source: clone R10E4 

C;Genetics : 

A; Gene: CESP:H3 8K22 .1 

A ; M ap position: 3 

A;Introns: 28/3; 99/3; 130/3; 239/3; 323/3; 363/2; 396/1; 444/3; 559/2; 651/3; 689/3; 793/1; 831/3 

Query Match 9.9%; Score 661.5; DB 2; Length 1579; 

Best Local Similarity 20.6%; Pred . No. 1.5e-28; 

Matches 324; Conservative 288; Mismatches 616; Indels 345; Gaps 54; 

Qy 2 I KRLKMVVKTFMDMDQDSEDE KQQYLPLALHLASEFFLRNPNKDVRLLVACCLADIF 5 8 

I : I I : : I • : • I ' ' I I h I I I I I I hi- Mhl 

Db 24 IERLRKLFNCLHDCKTNNSEEVSS PNRFARLFQHLSQECFLDNSNTDFRILLSLCLANIL 83 

Qy 59 RIYAPEAPYTSHDKLKDIFLFITRQLKGLEDT - - KSPQFNRYFYLLENLAWVKSYNICFE 116 

Ih II I I ||:::::: I ::|| I 11 = 1 II hi : I I 

Db 84 RIFQPELPTPSVMDLKEVYIYLFRTMRGLGDVTQDSPKFKNYFSLVETME - - KII PPIIE 141 

Qy 117 LEDCNE IFIQLFRTLFSV INNSHN 14 0 

::| :: :| | : : :: :|: 

Db 142 MKDHDDKEATPVFRALIKDILAIPCGKGWNQNLKKEARLLKIQENDDDSMNDDEEDENAA 201 

Qy 141 KKVQMHM LD LM S S 1 1 ME GD GVTQE LL GS I L INLI PAHKNLNKQS FD LAKVLL KRTVQT I E 200 

:|:: :: : :::| | | | | : ::| | : :| |: | : | 
Db 202 E KI RK SL IQ I ATT VI TN LD FVQNE CLDVLF YH 1 1 NPQRSNFAEARALAE DIIR 254 

Qy 201 ACIANFFNQVLVLGRSSVSDLSEH VFDLI QELFAI DPHLLLSV 243 

:| | : : Ih- :: |:::: |' : h 

Db 2 55 SCSDNESDTLANSIRSTMTAAAKEGKLPEEFELTGSSNRSKFFEVLRYLHYVSFDLVSGA 314 

Qy 2 44 MPQLEFKLKSNDGEERLAVVRLLAKLFGSKDSDLA- TQNRPLWQCFLGRFNDIHVPVRLE 3 02 

: :|:| |:| : : | | :: | | I I I II I III 

Db 315 IQELKFWLQSENEQYRKEAVTWGMLTRDKHCQFGMDSNDPTWSAFLNASIDQDDSVRHE 374 
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start 
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GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM protein - protein search, using sw model 
Run on: July 21, 2006, 21:02:48 



Search time 32 7 Seconds 
(without alignments) 
3668.946 Million cell updates/sec 



Title : 

Perfect score: 
Sequence: 

Scoring table: 



Searched: 



US-10-679 -580A-2 
6650 

1 MIKRLKMWKT FMDMDQDSE . 



. KLQDLAKKAAPAERQIDLQR 1297 



BLOSUM6 2 

Gapop 10.0 , Gapext 0.5 

2849598 seqs, 925015592 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 10 0% 
Listing first 45 summaries 



2849598 



Database 



UniProt_7. 2 : * 
1 : uniprot_sprot : * 
2: uniprot_trembl : 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



% 

Query 



No. 


Score 


Match 


Length 


DB 


ID 




Description 


1 


6650 


100 


.0 


1297 


2 


Q8NG14_ 


_ HUMAN 


Q8ngl4 


homo sapien 


2 


6643 


99 


.9 


1297 


2 


Q6 8DF7" 


"human 


Q68df 7 


homo sapien 


3 


6232 . 5 


93 


.7 


1356 


2 


Q5F3V3_ 


_CHICK 


Q5f 3v3 


gallus gall 


4 


5851 


88 


.0 


1323 


2 


Q4KLU7_ 


_XENLA 


Q4klu7 


xenopus lae 


5 


5849 


88 


.0 


1323 


2 


Q4QXM3_ 


_XENLA 


Q4qxm3 


xenopus lae 


6 


5462 


82 


.1 


1122 


2 


Q6A02 6 


_M0USE 


Q6a026 


mus musculu 


7 


4694 . 5 


70 


.6 


1446 


2 


Q4VA5 3_ 


_MOUSE 


Q4va53 


mus musculu 
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8 


4 682.5 


70 


. 4 


144 7 


2 


Q9NTI5 HUMAN 


Q9nti5 


homo sapien 


9 


4 682.5 


70 


. 4 


1483 


2 


Q9 Y2I 5_HUMAN 


Q9y2i5 


homo sapien 


1 0 


4 67 8 


70 


. 3 


1262 


2 


Q5F3U9_CHICK 


Q5f 3u9 


gallus gall 


11 


4 675.5 


70 


. 3 


144 6 


2 


Q7TSS4_M0USE 


Q7tss4 


mus musculu 


12 


4 652.5 


70 


. 0 


1391 


2 


Q9Y4 5 1_HUMAN 


Q9y451 


homo sapien 


13 


4 638.5 


69 


. 8 


144 8 


2 


Q4 98H 0_XENLA 


Q498h0 


xenopus lae 


14 


4 593 


69 


. 1 


1199 


2 


Q5U24 1_XENLA 


Q5u24 1 


xenopus lae 


15 


4 35 0 


65 


. 4 


851 


2 


Q9Y4D4 HUMAN 


Q9y4d4 


homo sapien 


1 6 


412 6.5 


62 


. 1 


143 8 


2 


Q4 SGI 3__TETNG 


Q4sgl3 


tetraodon n 


1 7 


3 712. 5 


55 


. 8 


1191 


2 


Q8 0TM8 MOUSE 


Q80 tm8 


mus musculu 


1 8 


3 68 6 


55 


. 4 


912 


2 


Q5R3S3 HUMAN 


Q5r3s3 


homo sapien 


1 9 


2 94 6 


44 


. 3 


58 4 


2 


Q8VDS0 MOUSE 


Q8vas 0 


mus musculu 


2 0 


2 84 2 


42 


. 7 


60 0 


2 


Q8N7J 4_HUMAN 


Q8n7 j 4 


homo sapien 


2 1 


2 83 8 


42 


. 7 


600 


2 


Q2TTR5 HUMAN 


Q2t tr5 


homo sapien 


2 2 


2 612. 5 


39 


. 3 


965 


2 


Q4QXM2 XENLA 


Q4qxm2 


xenopus lae 


2 3 


245 6. 5 


36 


. 9 


701 


2 


Q8BJ1 8_M0USE 


QSdj 18 


mus musculu 


2 4 


2 118 . 5 


31 


. 9 


824 


2 


Q8BLH 6_M0USE 


Q8blh6 


mus musculu 


2 5 


2 03 4 


30 


. 6 


52 9 


2 


Q8 IXT 6_HUMAN 


Q8ixt 6 


homo sapien 


2 6 


2 03 3 


30 


. 6 


454 


2 


Q4T0S 9_TETNG 


Q4t0s9 


tetraodon n 


2 7 


2 03 0 . 5 


30 


. 5 


113 0 


2 


Q7 PRK 5_AN0GA 


Q7prk5 


anopheles g 


2 8 


1911 


28 


. 7 


1218 


2 


Q9V64 6_DR0ME 


Q9v64 6 


drosophila 


2 9 


174 4 . 5 


26 


. 2 


432 


2 


Q7 SXH 3_BRARE 


Q7sxh3 


brachydanio 


3 0 


1715 


25 


. 8 


33 3 


2 


Q9 6DB 6_HUMAN 


Q96db6 


homo sapien 


3 1 


104 7 


15 


. 7 


24 2 


2 


Q6TRW4_RAT 


Q6trw4 


rattus norv 


3 2 


999 


15 


. 0 


198 


2 


Q9D33 7_M0USE 


Q9d33 7 


mus musculu 


3 3 


997.5 


15 


. 0 


533 


2 


Q5W0K8_HUMAN 


Q5w0k8 


homo sapien 


34 


92 5 . 5 


13 


. 9 


202 


2 


Q4 SZB 7__TETNG 


Q4szb7 


tetraodon n 


3 5 


856.5 


12 


. 9 


1561 


2 


Q5 VMT 5_0RYSA 


Q5vmt 5 


oryza sativ 


3 6 


785 . 5 


11 


. 8 


1250 


2 


Q4P82 3_USTMA 


Q4p823 


ustilago ma 


3 7 


77 9 


11 


. 7 


120 5 


1 


PDS5_SCHP0 


Q9hf f 5 


schizosacch 


3 8 


769.5 


11 


. 6 


163 8 


2 


Q9FIL0_ARATH 


Q9f ilO 


arabidops is 


3 9 


72 5.5 


10 


. 9 


127 9 


2 


Q5 5KR8_CRYNE 


Q55kr8 


cryptococcu 


H U 


TO C c 
/ ZD , D 


1 u 


Q 


1 O H Q 




yDlsAy o L.K xJMJi 


y oKacjb 


crypcococ cu 


41 


701. 5 


10 


. 5 


1481 


2 


Q2UB2 8_AS POR 


Q2ub2 8 


aspergillus 


42 


696 


10 


.5 


1690 


2 


Q6 0YT 4_CAEBR 


Q60yt4 


caenorhabdi 


43 


690. 5 


10 


.4 


1523 


2 


Q4I755_GIBZE 


Q4i755 


gibberella 


44 


685. 5 


10 


.3 


1492 


2 


Q4WZI1_AS PFU 


Q4wzil 


aspergillus 


45 


673. 5 


10 


. 1 


1596 


2 


Q9UVY6_S0RMA 


Q9uvy6 


sordaria ma 



ALIGNMENTS 



RESULT 1 
Q8NG1 4_HUMAN 

ID Q8NG14_HUMAN PRELIMINARY; PRT ; 1297 AA . 

AC Q8NG14 ; 

DT 01-OCT-2002, integrated into UniProtKB/TrEMBL . 

DT 01-OCT-2002, sequence version 1. 

DT 07 -FEB -200 6, entry version 13. 

DE SCC- 112. 

GN Name=SCC-112 ; 

OS Homo sapiens (Human) . 

0C Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

0C Mammalia; Eutheria; Euarchontogli res ; Primates; Catarrhini; Hominidae; 

0C Homo . 

OX NCBI_TaxID=9606 ; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RX PubMed=15019 99 8; DOI=10 . 1016/j .gene .2003 . 12 . 013 ; 

RA Kumar D. , Sakabe I., Pat el S., Zhang Y. , Ahmad I., Gehan E . A . ., 
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RA Whiteside T.L., Kasid U. ; 

RT "SCC-112, a novel cell cycle -regulated molecule, exhibits reduced 

RT expression in human renal carcinomas."; 

RL Gene 3 28:187-196(2 004). 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribut ion-NoDerivs License 

CC 

DR EMBL; AF294791; AAM82347.1; -; mRNA. 

DR Ensembl; ENSGO 00 00121892 ; Homo sapiens. 

DR GO; GO:0005488; F:binding; IEA. 

DR InterPro; IPR011989; ARM-like. 

DR InterPro; IPR000357; HEAT. 

DR Pfam; PF02 98 5; HEAT; 1. 

SQ SEQUENCE 1297 AA; 146610 MW; A4 2D13 E6 8CA5 80 8E CRC64 ; 

Query Match 100.0%; Score 6650; DB 2; Length 1297; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1297; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 M IKRL KM WKT FMDM DQDS ED EKQ QY LP LALH LASE FFLRNPNKDVRLLVAC CLAD I FRI 60 

I MM IMMIIIIMI II Mill IIMIIIIIIIMI IIIIIIMIIII II II II MM 

Db 1 M IKRL KM WKT FMDMDQDSEDEKQQYLPLALHLASEFFLRNPNKDVRLLVACCLADIFRI 60 

Qy 61 YAPEAPYTSHDKLKDIFLFITRQL KGLEDTKSPQFNRYFYLLENLAWVKS YNICFELEDC 12 0 

I M II II M M II M M II II Ml MM M II M II II II II II M II M II II II M II 

Db 61 YAPEAPYTSHDKLKDIFLFITRQL KGLEDTKSPQFNRYFYLLENLAWVKS YNICFELEDC 12 0 

Qy 121 NEI FI QL FRTLFS VINNSHNKKVQMHMLDLMSSI IMEGDGVTQELLGSILINLI PAHKNL 180 

I II II II II II II II II II II III II MM II II II II II II II II II M II II M II II 

Db 121 NEI FIQLFRTLFSVINNSHNKKVQMHMLDLMSSI IMEGDGVTQELLGSILINLI PAHKNL 180 

Qy 181 NKQSFDLAKVLLKRTVQTIEACIANFFNQVLVLGRSSVSDLSEHVFDLIQELFAIDPHLL 24 0 

I MM II II MMMIIMIIMI MMMIIMM II II II MIIMIIMM II II II 

Db 181 NKQSFDLAKVLLKRTVQTIEACIANF FNQVLVLGRSSVSDLSEHVFDLIQELFAIDPHLL 24 0 

Qy 241 LSVMPQLEFKL KSNDGEERLAVVRLLAKLFGSKDSDLATQNRPLWQCFLGRFNDIHVPVR 300 

I 1 1 1 1 1 1 i I 1 1 1 1 I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 

Db 241 LSVMPQLEFKLKSNDGEERLAWRLLAKLFGSKDSDLATQNRPLWQCFLGRFNDIHVPVR 300 

Qy 3 01 LESVKFASHCLMNHPDLAKDLTEYLKVRSHDPEEAIRHDVIVTIITAAKRDLALVNDQLL 360 

I II II II II II II II II II II III II MM II II II II II II II II II II II II II II II 

Db 3 01 LESVKFASHCLMNHPDLAKDLTEYLKVRSHDPEEAIRHDVIVTI ITAAKRDLALVNDQLL 36 0 

Qy 3 61 GFVRERTLD KRWRVRKEAMMG LAQLYKKYCLHGEAG KEAAEKVSWIKDKLLHIY YQNSID 420 

I M II M II II II II M II II III M MM II II II II II II II II II II II II II II II 

Db 3 61 GFVRERTLD KRWRVRKEAMMG LAQLYKKYCLHGEAG KEAAEKVSWIKDKLLHIY YQNSID 42 0 

Qy 4 21 DKLLVEKIFAQYLVPHNLETEERMKCLYYLYASLDPNAVKALNEMWKCQNMLRSHVRELL 48 0 

I II II II M II II II II II II III M MM II II II II II II II II II II II M II II II 

Db 4 21 DKL LVEKIFAQYLVPHNLETEERMKCLYYLYASLDPNAVKALNEMWKCQNMLRSHVRELL 48 0 

Qy 4 81 DLHKQ PTSEANCSAMFGKLMTIAKNLPDPGKAQDFVKKFNQVLGDDEKLRSQLELLISPT 54 0 

I II II II M II II M II II II Ml MM II MM II II II II II II II II M M II II II 

Db 4 81 DLHKQ PTSEANCSAMFGKLMTIAKNLPDPGKAQDFVKKFNQVLGDDEKLRSQLELLISPT 54 0 

Qy 541 CSCKQADICVREIARKLANPKQPTNPFLEMVKFLLERIAPVHIDSEAISALVKLMNKSIE 600 

I M II M II II M II M M II III llllll II II II II II II II II MM II II II M II 

Db 541 CSCKQADICVREIARKLANPKQPTNPFLEMVKFLLERIAPVHIDSEAISALVKLMNKSIE 600 

Qy 6 01 GTADDEEEGVS PDTAIRSGLELLKVLSFTH PTSFHSAETYESLLQCLRMEDDKVAEAAIQ 66 0 
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Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 
Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 



I IMIII II II II II II II II III II II II II II II INI II II II II II II II II II II 

6 01 GTADDEEEGVS PDTAIRSGLELLKVLSFTH PTSFHSAETYESLLQCLRMEDDKVAEAAIQ 66 0 

661 IFRNTGHKI ETDLPQIRSTLI PILHQKAKRGTPHQAKQAVHCIHAI FTNKEVQLAQIFEP 72 0 

I II II II II II II II II II II III MINI II II II II II II II II II II II II II II II 
661 IFRNTGHKI ETDLPQIRSTLI PILHQKAKRGTPHQAKQAVHCIHAI FTNKEVQLAQIFEP 72 0 

7 21 LSRSLNADVPEQLITPLVS LGHISMLAPDQFASPMKSVVANFIVKDLLMNDRSTGEKNGK 78 0 

I II II II II II MM II II IMIMI II II II II II MM II II II II II II II II II II 

7 21 LSRSLNADVPEQLITPLVSLGHISMLAPDQFASPMKSVVANFIVKDLLMNDRSTGEKNGK 78 0 



781 



781 



841 



LWS PDEE VS PE VLAKVQ AI KLLVR WLLGMKNNQS KS ANSTLRLLSAML VS EGDLTEQKRI 

I II II M M II MM II II II III II II II MM II II II II II II II II II II II II II 

LWS PDEEVSPEVLAKVQAI KLLVR WLLGMKNNQSKS ANSTLRLLSAML VS EGDLTEQKRI 



840 



840 



900 



SKSDMSRLRLAAGSAIMKLAQEPC YHEI ITPEQFQLCALVINDECYQVRQIFAQKLHKAL 

I II II II II II II II II II II III MM II M II II II M II II M II M II II II II II 

8 41 SKSDMSRLRLAAGSAIMKLAQEPC YHEI ITPEQFQLCALVINDECYQVRQIFAQKLHKAL 90 0 



901 



901 



VKLLLPLEYMAIFALCAKDPVKERRAHARQCLLKNI SIRREYIKQNPMATEKLLSLLPEY 

I M II II II II II II II II II III II II II II II II II II II II II M II II II II II II 

VKLLLPLEYMAIFALCAKDPVKERRAHARQCLLKNI SIRREYIKQNPMATEKLLSLLPEY 



960 



960 



9 61 VVPYMIHLLAHDPDFTRSQDVDQLRDIKECLWFMLEVLMTKNENNSHAFMKKMAENIKLT 10 20 

I II II II II II II II II II M III II II II II II II M II II II II II II II II II M II 

9 61 VVPYMIHLLAHDPDFTRSQDVDQLRDIKECLWFMLEVLMTKNENNSHAFM KKMAENIKLT 10 20 
1021 RDAQSPDESKTNE KLYTVCDVALCVINSKS ALCNADSPKDPVLPMKFFTQPEKDFCNDKS 10 80 

I II II II II II MM II II II Ml II II II II II II II II II II M II II II II II II II 

10 21 RDAQSPDESKTNEKLYTVCDVALCVINSKS ALCNADSPKDPVLPMKFFTQPEKDFCNDKS 10 80 
10 81 YISEETRVLLLTGKPKPAGVLGAVNKPLSATGRKPYVRSTGT ETGSNINVNSELNPSTGN 1140 

I II II II II M II II II II II III II II II II II II II II II II II II II II II II II II 

10 81 YISEETRVLLLTGKPKPAGVLGAVNKPLSATGRKPYVRSTGT ETGSNINVNSELNPSTGN 1140 
1141 RSREQSSEAAETGVSENEENPVRI ISVTPVKNIDPVKNKEINSDQATQGNISSDRGKKRT 12 00 

I II 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II II 1 1 M II 1 1 1 1 1 1 1 1 II II II 1 1 II 

1141 RSREQSSEAAETGVSENEENPVRI ISVTPVKNIDPVKNKEINSDQATQGNISSDRGKKRT 12 00 
12 01 VTAAGAENIQQKTDEKVDESGPPAPSKPRRGRRPKSESQGNATKNDDLNKPINKGRKRAA 12 60 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 

12 01 VTAAG AENI QQKTDE KVDE SG PPA PS KPRRGRRP KS ES QGNATKNDDLNK PI NKGRKR AA 12 60 

12 61 VGQ E S PG GL EAGN AK AP KL QD LAK KAAP AE RQ I D LQ R 12 97 

I I I I I II II I I I I I I II M II I I I I I I I M I I M I I I 

12 61 VGQ E S PG GL EAGN AK AP KL QD LAK KAAP AE RQ I D LQ R 12 97 



PRELIMINARY ; 



PRT; 129 7 AA . 



RESULT 2 
Q68DF7_HUMAN 

ID Q6 8DF7_HUMAN 

AC Q6 8DF7; 

DT 11 -OCT -200 4 , integrated into UniProtKB/TrEMBL . 

DT ll-OCT-2004, sequence version 1. 

DT 07 -FEB -200 6, entry version 12. 

DE Hypothetical protein DKFZp68 6B192 46 . 

GN Name=DKFZp68 6B19 24 6; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires ; Primates; Catarrhini; Hominidae; 

OC Homo . 
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OX NCBI JTax ID =9606; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Salivary gland; 

RG The German cDNA Consortium; 

RA Koehrer K. , Beyer A., Mewes H.W., Weil B. , Amid C, Osanger A., 

RA Fobo G., Han M., Wiemann S.; 

RL Submitted (AUG-2004) to the EMBL/GenBank/DDB J databases. 

cc 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribut ion-NoDerivs License 

CC 

DR EMBL; CR749425; CAH18263.1; -; mRNA . 

DR Ensembl; ENSGO 00 00 12 18 92 ; Homo sapiens. 

DR GO; GO:0005488; F:binding; IEA. 

DR InterPro; IPR011989; ARM-like. 

DR InterPro; IPR000357; HEAT. 

DR Pfam; PF02985; HEAT; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 1297 AA ; 146668 MW ; 3F6FC3D7 9DCE23 8D CRC64 ; 



Query Match 99.9%; Score 6643; DB 2; Length 1297; 

Best Local Similarity 9 9.9%; Pred. No. 0; 

Matches 129 6; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 MIKRLKMWKT FMDMDQDSEDEKQ QYLPLALHLASEFFLRNPNKDVRL LVACCLADIFRI 6 0 

lllllll II II MM IMIIIIIIIIIIIIIIIIMIIIIII II II MINI II II MM 

Db 1 MIKRLKMWKTFMDMDQDSEDEKQQYLPLALHLASEFFLRNPNKDVRLLVAC CLADIFRI 60 

Qy 61 YAPEAPYTSHDKLKDIFLFITRQLKGLEDTKS PQFNRY FYLLENLAWVKSYNICFELEDC 12 0 

II MM II M M MINI II Ml II II MMMM MM II MINI II MM MMM 

Db 61 YAP EA P Y TS HD KL KD I F LF IT RQL KG LE DT KS PQ FNRY FY LL EN LA WV KS YN I C FE LEDC 12 0 

Qy 121 NEIFIQLFRTLFSVINNSHNKKVQMHMLDLMSSI IMEGDGVTQELLGS ILINLI PAHKNL 180 

i 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! I 1 1 1 1 1 1 1 1 IMMMMMM 

Db 121 NEIFIQLFRTLFS VI NNSHNKKVQMHMLDLMS SI IMEGDGVTQELLDS ILINLI PAHKNL 18 0 

Qy 181 NKQSFDLAKVLLKRTVQTIEACIANFFNQVLVLGRSSVSDLSEHVFDLIQELFAIDPHLL 24 0 

I II II MM II II II II II Mill MMIIMIIII MMM II II II MM MINIM 

Db 181 NKQSFDLAKVLLKRTVQTIEACIANFFNQVLVLGRSSVSDLSEHVFDLIQELFAIDPHLL 24 0 

Qy 2 41 LSVMPQLEFKLKSNDGEERLAWRLLAKLFGSKDSDLATQNRPLWQCFLGRFNDIHVPVR 30 0 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 LSVMPQLEFKL KSNDGEERLAWR LLAKLFGSKDSDLATQNRPLWQCFLGRFNDIHVPVR 300 

Qy 3 01 LESVKFASHCLMNHPDLAKDLTEYLKVRSHDPEEAI RHDVIVTI ITAAKRDLALVNDQLL 360 

1 1 1 1 i 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 

Db 3 01 LESVKFASHCLMNHPDLAKDLTEYLKVRSHDPEEAIRHDVIVTI ITAAKRDLALVNDQLL 36 0 

Qy 3 61 GFVRERTLDKRWRVRKEAMMGLAQ LY KKYCLHGEAGKEAAEKVSWIKDKL LHIYYQNSID 42 0 

I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3 61 GFVRERTLDKRWRVRKEAMMGLAQLYKKYCLHGEAGKEAAEKVSWIKDKL LHIYYQNSID 42 0 

Qy 4 21 DKLLVEKIFAQYLVPHNLETEERMKCLYYLYASLDPNAVKALNEMWKCQNMLRSHVRELL 48 0 

I II M M M II II M II II II III II II II II M M II II M II II II II II II M II M 

Db 4 21 DKLLVEKIFAQYLVPHNLETEERMKCLYYLYASLDPNAVKALNEMWKCQNMLRSHVRELL 48 0 

Qy 4 81 DLHKQ PTSEANCSAMFGKLMTIAKNL PD PGKAQDFVKKFNQVLGDDEKLRSQLELLISPT 54 0 

IMMMII II MINI II Mill MM MMMM II II MMM II MMMM MM 

Db 4 81 DLHKQ PTSEANCSAMFG KLMTIAKNL PD PGKAQD FVKK FNQVLGDDEKLRSQLELLISPT 54 0 
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Qy 


541 


CSCKQADICVKEIARKLANPKQPTNPFLEMVKFLLERIAPVHIDSEAI salvklmnksie 


60 0 






IIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIMIIIII 




Db 


541 


CSCKQADICVREIARKLANPKQPT NP FL EM VKFLLERIAPVHIDSEAI SALVKLMNKSIE 


600 


Qy 


6 01 


GTADDEEEGVS PDTAIRSGLELLKVLSFTHPTSFHSAETY ESLLQCLRMEDDKVAEAAIQ 


66 0 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 




Db 


601 


GTADDEEEGVS PDTAIRSGLELLKVLSFTHPTSFHSAETY ESLLQCLRMEDDKVAEAAIQ 


660 


Qy 


6 ol 


1 rKJM 1 CjHKI hi DLPQIRbTLI PILHQKAKRGTPHQAKQAVHC IHAI FTNKEVQLAQIfc EP 


*7 *i r\ 
12. 0 






1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 




Db 


661 


IFRNTGHKIETDL PQIRSTLI PILHQKAKRGTPHQAKQAVHC IHAI FTNKEVQLAQIFEP 


720 


Qy 


7 21 


LSRSLNADVPEQLITPLVSLGHISMLAPDQFASPMKSVVANFIVKDLLMNDRSTGEKNGK 


78 0 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


721 


LSRSLNADVPEQLITPLVSLGHISMLAPDQFASPMKSVVANFIVKDLLMNDRSTGEKNGK 


780 


Qy 


7 81 


LWSPDEEVS PEVLAKVQAI KLLVRWL LGMKNNQSKS ANSTLRLL bAMLVbEGDLTEQKRI 


84 0 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


781 


LWSPDEEVSPEVLAKVQAI KLLVRWL LGMKNNQSKS ANSTLRLLSAMLVSEGDLTEQKRI 


840 


Qy 


8 41 


bKbDMSRLRLAAGbAIMKLAQEPC YHEI ITPEQFQLCALVINDECYQVKQIh AQKLHKAL 


90 0 






IIMMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


841 


SKSDMSRLRLAAGSAIMKLAQEPCYHEI ITPEQFQLCALVINDECYQVRQIFAQKLHKAL 


900 


Qy 


9 01 


VKLLLPLEYMAIFALCAKDPVKERRAHARQCLLKNISIRREYIKQN PMATEKLLSLLPEY 


96 0 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


. 901 


VKLLLPLEYMAIFALCAKDPVKERRAHARQCLLKNI SI RREYIKQN PMATEKLLSLLPEY 


960 


Qy 


9 61 


VVPYMIHLLAHDPDFTRSQDVDQLRDIKECLWFMLEVLMTKNENNSHAFMKKMAENIKLT 


10 20 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


961 


VVPYMIHLLAHDPDFTRSQDVDQLRDIKECLWFMLEVLMTKNENNSHAFMKKMAENIKLT 


1020 


Qy 


1021 


RDAQSPDESKTNE KL YTVCDVALCVINSKSALCNADSPKDPVLPMKFFTQPEKDFCNDKS 


1080 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1021 


RDAQSPDESKTNE KLYTVCDVALC VINSKSALCNADSPKDPVLPMKFFTQPEKDFCNDKS 


10.80 


Qy 


1081 


YISEETRVLLLTGKPKPAGVLGAVNKPLSATGRKPYVRSTGTETGSNINVNSELNPSTGN 


114 0 


Db 


1081 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

YISEETRVLLLTGKPKPAGVLGAVNKPLSATGRKPYVRSTGTETGSNINVNSELNPSTGN 


1140 


Qy 


1141 


RSREQSSEAAETGVS ENEENPVRI ISVTPVKNIDPVKNKEINSDQATQGNISSDRGKKRT 


12 00 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1141 


RSREQSSEAAETGVS ENEENPVRI ISVTPVKNIDPVKNKEINSDQATQGNISSDRGKKRT 


1200 


Qy 


12 01 


T TTT1 71 TV O TV ri 7v"f T /~\/~\ T7TTI T-VT~I T"/T 7 T"\ T"> O /""• T"S T1 7\ T^4 O Tl T1 "O T"! TT P Df 1 XTT\ ITIT^ r 'K'TT> T> T MT/ TIT "KITS PT)T/T) 71 7\ 

VTAAGAENIQQKTDEKVDESGPPAPSKPRRGRRPKbEbQGNATKNDDLNKPINKGRKRAA 


1 "J c c\ 






1 IMIIMI Ihll II III IMII II II M II II III IIIIIMI 




Db 


12 01 


VTAAGAENIQQKTDEKVDESGPPAPSKPRRGRRPKSESQGNATKNDDLNKPINKGRKRAA 


12 60 


Qy 


1261 


VGQ E S PG GL EAGN AK AP KL QD LAK KAAP AE RQ I D LQ R 12 97 








1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 
1 II II 1 II II 1 1 1 1 1 1 II 1 II II 1 1 II II 1 II II II 1 




Db 


1261 


VGQ E S PG GL EAGN AK AP KL QD LAK KAAP AE RQ I D LQ R 12 97 




RESULT 


3 






Q5F3V3_ 


,CHICK 







ID Q5F3V3_CHICK PRELIMINARY; PRT ; 1356 AA . 

AC Q5F3V3; 

DT 15-MAR-2005, integrated into UniProtKB/TrEMBL . 

DT 15-MAR-2005, sequence version 1. 

DT 07-FEB-2006, entry version 4. 

DE Hypothetical protein. 
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GN 


ORFNames=RCJMB04_6f4 ; 




OS 


Gallus gallus (Chicken) . 




OC 


Eukaryota; Metazoa; Chordata; Craniata; 


■ Vertebrata; Euteleostomi ; 


OC 


Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae; 


OC 


oaiius . 




OX 


NCBI TaxID=9 031; 




RN 


[1] 




RP 


NUCLEOTIDE SEQUENCE. 




RC 


STRAIN=CB; TISSUE=:Bursa ; 




RA 


Caldwell R.B., Kierzek A.M., Arakawa H. 


. , Bezzubov Y., Zaim J., 


RA 


Fiedler P., Kutter S., Blagodatski A., 


Kostovska D. , Koter M. , 


RA 


Plachy J., Carninci P., Hayashizaki Y., 


Buerstedde J.M. ; 


RT 


"Full-length cDNAs from chicken bursal 


lymphocytes to facilitate 


RT 


genefunction analysis."; 




RL 
CC 
CC 


Genome Biol. 6 : R6-R6 ( 2005 ) . 




Copyrighted by the UniProt Consortium, 


see http://www.uniprot.org/terms 


CC 
CC 
DR 


Distributed under the Creative Commons 


Attribution-NoDerivs License 


EMBL; AJ851547; CAH651 81 . 1 ; -; mRNA. 




DR 


GO; GO:0005488; F:binding; IEA. 




DR 


InterPro; IPR011989; ARM-like. 




DR 


InterPro; IPR000357; HEAT. 




DR 


Pfam; PF02985; HEAT; 1. 




KW 


Hypothetical protein. 




SQ 


SEQUENCE 1356 AA ; 153085 MW ; 4C 9D 9F2FB6DA9C42 CRC64 ; 



Query Match 93.7%; Score 6232.5; DB 2; Length 1356; 

Best Local Similarity 93.5%; Pred . No. 0; 

Matches 1213; Conservative 36; Mismatches 46; Indels 3; Gaps 3; 



Qy 


i 


MIKRLKMWKT FMDMDQDSEDEKQQYLPLALHLASEFFLRNPNKDVRLLVACCLADIFRI 


60 






-1 II II II II INI II II II III II II Mil Mil II II II II II II II II II II MM 




Db 


61 


VVKRLKMWKT FMDMDQDSEDEKQ QYLPLALHLASEFFLRNPNKDVRLLVACCLADIFRI 


120 


Qy 


61 


YAPEAPYTSHDKLKDIFLFITRQLKGLEDTKSPQFNRY FYLLENLAWVKS YNICFELEDC 


120 






1 II II MM II II II II M M Mill II II II II II II II II II II II II M II II M II 




Db 


121 


YAP EAPYTS HD KLKD I FLFITRQLKGLEDTKSPQFNRY FYLLENLAWVKS YNICFELEDC 


180 


Qy 


121 


NEIFIQLFRTLFSVINNSHNKKVQMHMLDLMSSIIMEGDGVTQELLGSILINLI PAHKNL 


180 






1 M II II M II II II II II M III II II II MM II M II M II II IMMMMMM 




Db 


181 


NEIFIQLFRTLFSVINNSHNQKVQMHMLDLMSSI IMEGDGVTQELLDSILINLI PAHKNL 


240 


Qy 


181 


NKQSFDLAKVLLKRTVQTIEACIANFFNQVLVLGRSSVSDLSEHVFDLIQELFAIDPHLL 


240 






1 II M II II II II MM II 1 II MM M II II M M M M M M M llllllllil 




Db 


241 


NKQAFDLAKVLLKRT VQTIEPCIANFFNQVLVLG KSSVSDLSEHVFDLILELFAIDPHLL 


300 


Qy 


241 


LSVMPQLEFKLKSNDGEERLAWRLLAKLFGSKDSDLATQNRPLWQCFLGRFNDIHVPVR 


300 






1 II II II II II II II II II II III II II MM II II II II II II II II II II II II II II 




Db 


301 


LSVMPQLEFKLKSNDGEERLAWRLLAKLFGSKDSDLATQNR PLWQCFLGRFNDIHVPVR 


360 


Qy 


301 


LES VKFASHCLMNHPDLAKDLTEY LKVRSHDPEEAI RHDVIVTI ITAAKRDLALVNDQLL 


360 






1 M II II II II II II II II II III II II M II M II II II II II II 1 IIMMIIMII 




Db 


361 


LESVKFASHCLMNHPDLAKDLTEY LKVRSHDPEEAI RHDVIVTI ITAGKRDLSLVNDQLL 


420 


Qy 


361 


GFVRERTLDKRWRVRKEAMMGLAQLY KKYCLHGEAGKEAAEKVSWIKDKLLHIYYQNSID 


420 






1 II II M II II II M II II II Ml II II II II 1 II MM II II M II II II M II II II 




Db 


421 


GFVRE RTLDKRWRVRKEAMMG LAQ LY KKYCLHAEAGKDAAEKVSWIKDKLLHIYYQNSID 


480 


Qy 


421 


DKLLVEKIFAQYLVPHNLETEERMKCLYYLYASLDPNAVKALNEMWKCQNMLRSHVRELL 


480 



lllllrll II IIMIIIII IMIIUIMI 1 ll'll III IMIIIIMM 
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Db 4 81 DKLLVEKIFAQYLVPHNLETEERM KCLYYLYASLDPNAVKALNEMWKCQNMLRSHVRELL 540 

Qy 4 81 DLH KQ PT SE ANCS AM FG KLMT I AKNL PD PGKAQD FVKKFNQVLGDDEKLR SQLE LL I S PT 54 0 

lllllllllll :|||llllllll II II II II M II MM II II M II I! II II II II h 

Db 541 DLHKQPT SEAN SAAMFG KLMT IAKNLPDPGKAQDFVKKFNQVLGDDEKLRSQLELL IS PT 60 0 

Qy 541 CSCKQADICVREIARKLANPKQPTNPFLEMVKFLLERIAPVHIDSEAISALVKLMNKSIE 60 0 

I MINI M M M II M M II Ml M II II M II M M II M IMMI M M II II II M 

Db 601 CSCKQADVCVREIARKLANPKQPTNPFLEMVKFLLERIAPVHIDSEAISALVKLMNKSIE 660 

Qy 6 01 GTADDEEEGVSPDTAIRSGLELLKVLSFTH PTSFHSAETYESLLQCLRMEDDKVAEAAIQ 66 0 

II II II II II II II II M II Mill II II II II II II II II II il M II II M M II II 

Db 661 GTADDEEEGVSPDTAIRAGLELLKVLSFTH PTSFHSAETYESLLQCLRMEDDKVAEAAIQ 72 0 

Qy 661 IFRNTGHKIETDLPQIRSTLI PILHQKAKRGTPHQAKQAVHCIHAI FTNKEVQLAQIFEP 72 0 

lllllll MM II MINI II III II MMIIMMMMIIMMMIIMM MM II 

Db 721 IFRNTGHKIETDLPQIRSTLI PILHQKAKRGTPHQAKQAVHCIHAI FSNKEVQLAQIFEP 780 

Qy 721 LSRSLNADVPEQL ITPL VS LGHIS MLAPDQFASPMKSVVANF I VKDLLMNDRSTGEKNGK 78 0 

I II II II 1 1 M 1 1 II 1 1 1 1 II II I II II II 1 1 1 1 1 1 1 1 II 1 1 M II II II 1 1 1 1 1 1 1 1 1 1 

Db 781 LSRSLNADVPEQL IT PL VS LGHISMLAPDQFASPMKSVVANFWKDLLMNDRSTGEKNGK 84 0 

Qy 781 L WS PDEEVS PE VLAKVQAI KLLVR WLLGMKNNQS KS ANSTLRLL SAMLVS EGDLTEQKRI 84 0 

IIIIIMM II II MM II Mill MMMMMMMMMMMMIIMM MM II 

Db 841 LWS PDEEVS PE VLAKVQAI KLLVR WLLGMKNNQSKS AN STLRLLSAMLVSEGDLTEQKRI 90 0 

Qy 841 SKSDMSRLRLAAGSAIMKLAQEPC YHEI ITPEQFQLCALVINDECYQVRQIFAQKLHKAL 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db '9 01 SKSDMSRLRLAAGSAIMKLAQEPC YHEI ITPEQFQLCALVINDECYQVRQIFAQKLHKAL 96 0 

Qy 9 01 VKLLLPLEYMAIFALCAKDPVKERRAHARQCLLKNI SIRREYIKQNPMATEKLLSLLPEY 96 0 

I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I II I II I I I I I I I I I I I I I I 

Db 9 61 VKLLLPLEYMAIFALCAKDPVKERRAHARQCLLKNI SIRREYIKQNPMANEKLLSLLPEY 10 20 

Qy 9 61 VVPYMIHLLAHDPDFTRSQDVDQLRDIKECLWFMLEVLMTKNENNSHAFMKKMAENIKLT 10 20 

I I I I I II II I I I I I I M I M I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1021 VVPYMIHLLAHDPDFTKPQDVDQLRDVKECLWFMLEVLMTKNENNSHAFM KKMAENIKLT 10 80 

Qy 1021 RDAQSPDESKTNE KLYTVCDVALCVINSKSALCNADSPKDPVLPMKFFTQPEKDFCNDKS 1080 

MINIM I MINI M II III M II II II II II II II II II MINIUM II- 

Db 10 81 RDAQSPDEPKANE KLYTVCDVALCVINSKSALCNADSPKDPVLPTKFFTQPEKDFSNDRN 1140 

Qy 10 81 YISEETRVLLLTGKPKPAGVLGAVNKPLSATGRKPYVRSTGTETGSNINVNSELNPSTGN 1140 

I II II II M II II II II III I II II II M MUM M MM II II M II IM I II 

Db 1141 Y I S EETR VLLLTGKP KPTG VLDTVNKPL SATGRR PY IRTTGS ETGSNI S VNS EL SS SAGN 12 00 

Qy 1141 RSREQSSEAAETGVS ENEENPVRI ISVTPVKNIDPVKNKEINSDQATQGNISSDRGKKRT 1200 

I I I II II : : I I II I I I = I I I I I I I I I I I. I M I II I I I I II I I I I II h : I I I I II 
Db 1201 RSREQSSDISETGVS ENDENPVRI ISVTPAKT-EPVKNKEINSDQATQGN -STE RGKKRT 1258 

Qy 12 01 VTAAGAENI QQ KTDE -KVD ESGPPAPSKPRRGRRPKSESQGNATKNDDLNKPINKGRKRA 125 9 

MM Ml II :| IMII : | MM II I II lh: III NUN 
Db 12 59 ATASGTENIHQKAEENNADETGPSLAAKTRRGRPPKPEPQGTTAKNEETNKPPVRGRKRA 1318 

Qy 1260 AVGQESPGGLEAGNAKAPKLQDLAKKAAPAERQIDLQR 1297 

I IMII IMIIMIN II III I MINIMI 
Db 1319 AASQESPGSLEAGNAKAPKQQDTAKKPAAAQRQIDLQR 135 6 



RESULT 4 
Q4KLU7 XENLA 
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ID Q4KLU7_XENLA PRELIMINARY; PRT ; 1323 AA . 

AC Q4KLU7; 

DT 02-AUG-2005, integrated into UniProtKB/TrEMBL . 

DT 02 -AUG -2005, sequence version 1. 

DT 07 -FEB -200 6 , entry version 2. 

DE Hypothetical protein. 

OS Xenopus laevis (African clawed frog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ,- 

0C Amphibia; Batrachia; Anura; Mesobatrachia ; Pipoidea; Pipidae; 

OC Xenopodinae; Xenopus ; Xenopus. 

OX NCBI_TaxID = 8 3 5 5 ; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Oocytes ; 

RX MEDLINE=22341132 ; PubMed=124 54 917 ; DOI= 10 . 1002/dvdy . 1 01 74 ; 

RA Klein S.L., Strausberg R.L., Wagner L., Pontius J., Clifton S.W., 

RA Richardson P.; 

RT "Genetic and genomic tools for Xenopus research: The NIH Xenopus 

RT initiative . 11 ; 

RL Dev. Dyn. 225:384-391(2 002). 

RN [2] 

RP NUCLEOTIDE SEQUENCE . 

RC TISSUE=Oocytes ; 

RX MEDLINE=22 3 8 82 57 ; PubMed=l 24 77 932 ; DOI=10 . 1 07 3 /pnas . 2 42 60 3 8 99 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L . , Shenmen C . M . , Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T. , Max S.I., Wang J., Hsieh F., 

RA Diatchenko L . , Marusina K. , Farmer A. A. , Rubin G.M., Hong L . , 

RA Stapleton M . , Soares M.B., Bonaldo M.F.,. Casavant T.L., Scheetz T.E., 

RA Browns tein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C. , 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M . , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences." ; 

RL Proc. Natl. Acad. Sci/ U.S.A. 99:16899-169 03(2 002). 
RN [3] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Oocytes ; 

RA Klein S., Gerhard D.S.; 

RL Submitted (JUL-200 5) to the EMBL/GenBank/DDB J databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribut ion-NoDerivs License 

CC 

DR EMBL; BC098992 ; AAH98992.1; - ; mRNA. 

KW Hypothetical protein. 

SQ SEQUENCE 1323 AA; 149527 MW; CC1C1F80CFD0FFBD CRC64 ; 

Query Match 88.0%; Score 5851; DB 2; Length 1323; 

Best Local Similarity 87.6%; Pred . No. 5.8e-307 ; 

Matches 113 8; Conservative 73; Mismatches 78; Indels 10; Gaps 1; 
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MIKRLKMWKTFMDMDQDSEDEKQQYLPLALHLASEFFLRNPNKDVRLLVACCLADIFRI 60 

::| II II II II II II II II hill MM II II h hll II MM II II II II II II II I! 

VVKRLKMWKTFMDMDQDSEEEKQQYLPLALHLSSDFFLRNPNKDVRLLVACCLADIFRI 92 

YAPEAPYTSHDKLKDIFLFITRQLKGLEDTKS PQFNRY FYLLENLAWVKSYNICFELEDC 12 0 

I II II II MM II hll II II III MM II II II II II M II II M II II II II II II II 

YAPEAPYTSHDKL KEIFLFITRQL KGLEDT KS PQFNRY FYLLENLAWVKS YNICFELEDC 15 2 

NEI FIQLFRTLFS VINNSHNKKVQMHMLDLMSSI IMEGDGVTQELLGS ILINLI PAHKNL 18 0 

I II II II hll II II II II h III II M II II II IMMMM I I 1 1 1 M I IMM 

NEIFIQLFKTLFSVINNSHNQKVQMHMLDLMSSITMEGDGVTQEQLDS ILINLI SAHKNL 212 

NKQSFDLAKVLLKRTVQTIEACIANFFNQVLVLGRSSVSDLSEHVFDLIQELFAIDPHLL 24 0 

MhlMMMMM IMI 1 1 1 1 1 1 1 1 1 1 1 h 1 1 1 1 1 1 1 1 1 1 M I 1 1 1 1 1 1 1 1 1 1 M 

NKQAFDLAKVLLKRTAQTIEPCIANFFNQVLVLGKSSVSDLSEHVFDLIQELFAIDPHLL 27 2 

LSVMPQLEFKLKSNDGEERLAWRLLAKLFGSKDSDLATQNRPLWQCFLGRFNDIHVPVR 30 0 

I I I I I II I I II I I I I I 1 I 1 I I 1 I I I I I I I I I I I I 1 I I I I I I I 11 I I I I I I II I I I I I I I I 

LSVMPQLEFKLKSNDGEERLAWRLLAKLFGSKDSDLATQNRPLWQCFLGRFNDIHVPVR 33 2 

LESVKFASHCLMNHPDLAKDLTEYLKVRSHDPEEAIRHDVIVTIITAAKRDLALVNDQLL 36 0 

III II M II II II MM II II Ih MM II II II II II II II II II II h II lllllll 

LESVKFASHCLMNHPDLAKDLTEFLKVRSHDPEEAIRHDVIVTI ITAAKKDLFLVNDQLL 3 92 

GFVRERT LD KRWRVRKEAMMGLAQLYKKYCLHGEAGKEAAEKVSWI KDKLLHIYYQNSID 42 0 

I II II II I! II II II M M II Ml M II II M II I h II II M II II M II II II II II 

GFVRERTLDKRWRVRKEAMMGLAQLYKKYCLHGEGGKDAAEKVSWIKDKLLHIYYQNSID 4 52 

DKLLVEKIFAQYLVPHNLETEERMKCLYYLYASLDPNAVKALNEMWKCQNMLRSHVRELL 48 0 

1 1 1 1 1 1 1 1 1 1 1 Ml II II II Ml MM II II II II II M II II M II II M M II II II 

DKLLVEKIFAQQLVPHNLETEERMKC LYYLYASLDPNAVKALNEMWKCQNMLRSHVRELL 512 

DLH KQ PTSEANCSAMFGKLMTIAKNL PD PGKAQD FVKKFNQVLGDDEKLRSQLELLISPT 54 0 

MMMMIM MM 1 1 1 h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 1 1 1 1 1 II I h 1 1 1 h 

DLHKQ PT SEANTTAMFAKLMTVAKNLPD PGKAQD FVKK FN QVLGEDEKLRSQLEVLIS PS 57 2 

CSCKQADICVREIARKLANPKQPTNPFLEMVKFLLERIAPVHIDSEAI SALVKLMNKSIE 60 0 

I II II II M MM II IMI Mill M II IMI IMI II II II II II M II IMI II II M 

CSCKQADVCWDIARKVANPKQPTNPFLEMVKFLLERIAPVHIDSEAI SALVKLMNKSIE 63 2 

GTADDEEEGVS PDTAIRSGLELLKVLSFTH PTSFHSAETYESLLQCLRMEDDKVAEAAIQ 66 0 

I IMI II II M IMIMM II III M II IMI IMI Ml II II II IMI IMI IMI M 

GTADDEEEGVS PD SAIRAGLELLKVLSFTH PTSFHSDETYESLLQCLRMEDDKVAEAAIQ 69 2 



MMMMMIMMMI II I II 1 1 I II 1 1 M M M I I II M I M I M M M M M M 



LSRSLNADVPEQLITPLVSLGHISMLAPDQ FASPMK SVVANFIVKDLLMNDRSTGEKNGK 78 0 

MIMM MMIMIMM II III IIMIMI II II MMIIIMI II IMI I IMMI 

LSRSLNADVPEQLVTPLVS LGHIS MLAPDQ FASPMK SVVANF I VKDLLMNDRSNGDKNGK 812 

LWS PDEE VS PE VLAKVQAI KLLVRWLLGMKNNQSKSANSTLRLLSAML VSEGDLTEQKRI 84 0 

II M 1 1 > I 1 1 1 1 1 1 I II II Ml II MM IMIMM II MM II II II MM IMMI 

LWCPDEEVSPEVLAKGQAI KLLVRWLLGMKNNQSKSANSTLRLLSAMLVS EGDLTEQKRI 87 2 

SKSDMSRLRLAAGSAIMKLAQEPC YHEI ITPEQFQLCALVINDECYQVRQIFAQKLHKAL 90 0 

I IMMIMM II M II MM III IIMIMI II II II IMIMM II M II M IMMI 

SKSDMSRLRLAAGAAIMKLAQEPCYHEIITPEQFQLCALVINDECYQVRQIFAQKLHKAL 93 2 

9 01 VKLLLPLEYMAIFALCAKDPVKERRAHARQCLLKNI SIRREYIKQNPMATEKLLSLLPEY 96 0 
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